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This article owes everything to the ideas of Wiles, and the arguments pre- 
sented here are fundamentally his [W3], though they include both the work 
[TW] and several simplifications to the original arguments, most notably that 
of Faltings. In the hope of increasing clarity, we have not always stated 
theorems in the greatest known generality, concentrating instead on what is 
needed for the proof of the Shimura-Taniyama conjecture for semi-stable ellip- 
tic curves. This article can serve as an introduction to the fundamental papers 
[W3] and [TW], which the reader is encouraged to consult for a different, and 
often more in-depth, perspective on the topics considered. Another useful 
more advanced reference is the article [Di2] which strengthens the methods of 
[W3] and [TW] to prove that every elliptic curve that is semistable at 3 and 5 
is modular. 


Introduction 


Fermat’s Last Theorem 


Fermat’s Last Theorem states that the equation 
r” +y” =z”, ee oe) 


has no integer solutions when n is greater than or equal to 3. Around 1630, 
Pierre de Fermat claimed that he had found a “truly wonderful” proof of this 
theorem, but that the margin of his copy of Diophantus’ Arithmetica was too 
small to contain it: 


“Cubum autem in duos cubos, aut quadrato quadratum in duos 
quadrato quadratos, et generaliter nullam in infinitum ultra qua- 
dratum potestatem in duos ejusdem nominis fas est dividere; cujus 
rei demonstrationem mirabile sane detexi. Hanc marginis exiguitas 
non caperet.” 


Among the many challenges that Fermat left for posterity, this was to prove 
the most vexing. A tantalizingly simple problem about whole numbers, it 
stood unsolved for more than 350 years, until in 1994 Andrew Wiles finally 
laid it to rest. 


Prehistory: The only case of Fermat’s Last Theorem for which Fermat actu- 
ally wrote down a proof is for the case n = 4. To do this, Fermat introduced 
the idea of infinite descent which is still one the main tools in the study of 
Diophantine equations, and was to play a central role in the proof of Fermat’s 
Last Theorem 350 years later. To prove his Last Theorem for exponent 4, Fer- 
mat showed something slightly stronger, namely that the equation x++y* = z? 
has no solutions in relatively prime integers with xyz # 0. Solutions to such 
an equation correspond to rational points on the elliptic curve v? = u? — 4u. 
Since every integer n > 3 is divisible either by an odd prime or by 4, the result 
of Fermat allowed one to reduce the study of Fermat’s equation to the case 
where n = £ is an odd prime. 

In 1753, Leonhard Euler wrote down a proof of Fermat’s Last Theorem for 
the exponent £ = 3, by performing what in modern language we would call 
a 3-descent on the curve x? + y? = 1 which is also an elliptic curve. Euler’s 
argument (which seems to have contained a gap) is explained in [Edw], ch. 2, 
and [Dic1], p. 545. 

It took mathematicians almost 100 years after Euler’s achievement to han- 
dle the case Z = 5; this was settled, more or less simultaneously, by Gustav 


Peter Lejeune Dirichlet [Dir] and Adrien Marie Legendre [Leg] in 1825. Their 
elementary arguments are quite involved. (Cf. [Edw], sec. 3.3.) 

In 1839, Fermat’s equation for exponent 7 also yielded to elementary meth- 
ods, through the heroic efforts of Gabriel Lamé. Lamé’s proof was even more 
intricate than the proof for exponent 5, and suggested that to go further, new 
theoretical insights would be needed. 


The work of Sophie Germain: Around 1820, in a letter to Gauss, Sophie 
Germain proved that if Z is a prime and q = 2£+1 is also prime, then Fermat’s 
equation zf + yf = zf with exponent £ has no solutions (x,y,z) with zyz 4 0 
(mod £). Germain’s theorem was the first really general proposition on Fer- 
mat’s Last Theorem, unlike the previous results which considered the Fermat 
equation one exponent at a time. 

The case where the solution (x,y,z) to 26+ y' = z“ satisfies ryz #0 (mod 
L) was called the first case of Fermat’s Last Theorem, and the case where £ 
divides xyz, the second case. It was realized at that time that the first case 
was generally easier to handle: Germain’s theorem was extended, using similar 
ideas, to cases where kl +1 is prime and k is small, and this led to a proof that 
there were no first case solutions to Fermat’s equation with prime exponents 
£ < 100, which in 1830 represented a significant advance. The division between 
first and second case remained fundamental in much of the later work on the 
subject. In 1977, Terjanian [Te] proved that if the equation x?‘ +y” = z” has 
a solution (x,y, z), then 2@ divides either x or y, i.e., “the first case of Fermat’s 
Last Theorem is true for even exponents”. His simple and elegant proof used 
only techniques that were available to Germain and her contemporaries. 


The work of Kummer: The work of Ernst Eduard Kummer marked the 
beginning of a new era in the study of Fermat’s Last Theorem. For the first 
time, sophisticated concepts of algebraic number theory and the theory of 
L-functions were brought to bear on a question that had until then been 
addressed only with elementary methods. While he fell short of providing 
a complete solution, Kummer made substantial progress. He showed how 
Fermat’s Last Theorem is intimately tied to deep questions on class numbers 
of cyclotomic fields which are still an active subject of research. Kummer’s 
approach relied on the factorization 


(x + y)(at Gy) (a +y) = 24 


of Fermat’s equation over the ring Z[¢e] generated by the ¢th roots of unity. 
One observes that the greatest common divisor of any two factors in the prod- 
uct on the left divides the element (1 — Çe), which is an element of norm £. 


Since the product of these numbers is a perfect ¢-th power, one is tempted to 
conclude that (x +y), ... ,(@+¢)~'y) are each €th powers in the ring Z[¢e] up 
to units in this ring, and up to powers of (1 — Çe). Such an inference would be 
valid if one were to replace Z[¢e] by Z, and is a direct consequence of unique 
factorization of integers into products of primes. We say that a ring R has 
property UF if every non-zero element of R is uniquely a product of primes, 
up to units. Mathematicians such as Lamé made attempts at proving Fer- 
mat’s Last Theorem based on the mistaken assumption that the rings Z|) 
had property UF. Legend even has it that Kummer fell into this trap, al- 
though this story now has been discredited; see for example [Edw], sec. 4.1. In 
fact, property UF is far from being satisfied in general: one now knows that 
the rings Z[¢¢] have property UF only for £ < 23 (cf. [Wal], ch. 1). 

It turns out that the full force of property UF is not really needed in the 
applications to Fermat’s Last Theorem. Say that a ring R has property U Fẹ 
if the following inference is valid: 


ab = 2", and gcd(a,b) = 1 = a and b are lth powers up to units of R. 


If a ring R has property UF’, then it also has property U Fy, but the converse 
need not be true. Kummer showed that Fermat’s last theorem was true for 
exponent £ if Z[¢,| satisfied the property UF; (cf. [Wa]). The proof is far from 
trivial, because of difficulties arising from the units in Z[¢)] as well as from 
the possible failure of property UF. (A number of Kummer’s contemporaries, 
such as Cauchy and Lamé, seem to have overlooked both of these difficulties 
in their attempts to prove Fermat’s Last Theorem.) 

Kummer then launched a systematic study of the property U Fp for the 
rings Z|¢,|. He showed that even if Z[¢;| failed to have unique factorization, 
it still possessed unique factorization into prime ideals. He defined the ideal 
class group as the quotient of the group of fractional ideals by its subgroup 
consisting of principal ideals, and was able to establish the finiteness of this 
class group. The order of the class group of Z[¢z], denoted he, could be taken 
as a measure of the failure of the ring Z[¢;] to satisfy UF. It was rather 
straightforward to show that if Z did not divide hy, then Z|¢;| satisfied the 
property U F;. In this case, one called £ a regular prime. Kummer thus showed 
that Fermat’s last theorem is true for exponent ¢ if £ is a regular prime. 

He did not stop here. For it remained to give an efficient means of com- 
puting hy, or at least an efficient way of checking when £ divides hy. The class 
number hy can be factorized as a product 


he = hthy, 


where h is the class number of the real subfield Q(¢,)*, and họ is defined as 
h/h}. Essentially because of the units in Q(¢;)*, the factor hf is somewhat 
difficult to compute, while, because the units in Q(¢,)* generate the group of 
units in Q(¢e) up to finite index, the term hy can be expressed in a simple 
closed form. Kummer showed that if £ divides h}, then £ divides hy. Hence, £ 
divides he if and only if £ divides h; . This allowed one to avoid the difficulties 
inherent in the calculation of h}. Kummer then gave an elegant formula for hy 
by considering the Bernoulli numbers B,,, which are rational numbers defined 


by the formula 
£ Bry 
Se oa 


He produced an explicit formula for the class number h; , and concluded that 
if £ does not divide the numerator of Bə, for 1 < i < (£ — 3)/2, then £ is 
regular, and conversely. 

The conceptual explanation for Kummer’s formula for hy lies in the work 
of Dirichlet on the analytic class number formula, where it is shown that hy 
can be expressed as a product of special values of certain (abelian) L-series 


OO 


L(s,x) = >> x(n)n-* 


associated to odd Dirichlet characters. Such special values in turn can be 
expressed in terms of certain generalized Bernoulli numbers B,,,, which are 
related to the Bernoulli numbers B; via congruences mod £. (For more details, 
see [Wal.) 

These considerations led Kummer to initiate a deep study relating congru- 
ence properties of special values of L-functions and of class numbers, which 
was to emerge as a central concern of modern algebraic number theory, and 
was to reappear — in a surprisingly different guise — at the heart of Wiles’ 
strategy for proving the Shimura-Taniyama conjecture. 


Later developments: Kummer’s work had multiple ramifications, and led 
to a very active line of enquiry pursued by many people. His formulae re- 
lating Bernoulli numbers to class numbers of cyclotomic fields were refined 
by Kenneth Ribet [R1], Barry Mazur and Andrew Wiles [MW], using new 
methods from the theory of modular curves which also play a central role in 
Wiles’ more recent work. (Later Francisco Thaine [Th] reproved some of the 
results of Mazur and Wiles using techniques inspired directly from a reading 
of Kummer.) In a development more directly related to Fermat’s Last Theo- 
rem, Wieferich proved that if £2? does not divide 251 — 1, then the first case 
of Fermat’s Last Theorem is true for exponent £. (Cf. [Ri], lecture VIII.) 
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There were many other refinements of similar criteria for Fermat’s Last 
theorem to be true. Computer calculations based on these criteria led to a 
verification that Fermat’s Last theorem is true for all odd prime exponents less 
than four million [BCEM], and that the first case is true for all £ < 8.858- 107° 
[Su]. 

The condition that £ is a regular prime seems to hold heuristically for about 
61% of the primes. (See the discussion on p. 63, and also p. 108, of [Wa], for 
example.) In spite of the convincing numerical evidence, it is still not known 
if there are infinitely many regular primes. Ironically, it is not too difficult to 
show that there are infinitely many irregular primes. (Cf. [Wa].) 

Thus the methods introduced by Kummer, after leading to very strong 
results in the direction of Fermat’s Last theorem, seemed to become mired in 
difficulties, and ultimately fell short of solving Fermat’s conundrum!. 


Faltings’ proof of the Mordell conjecture: In 1985, Gerd Faltings [Fa] 
proved the very general statement (which had previously been conjectured 
by Mordell) that any equation in two variables corresponding to a curve of 
genus strictly greater than one had (at most) finitely many rational solutions. 
In the context of Fermat’s Last Theorem, this led to the proof that for each 
exponent n > 3, the Fermat equation x” + y” = z” has at most finitely many 
integer solutions (up to the obvious rescaling). Andrew Granville [Gra] and 
Roger Heath-Brown [HB] remarked that Faltings’ result implies Fermat’s Last 
Theorem for a set of exponents of density one. 

However, Fermat’s Last Theorem was still not known to be true for an 
infinite set of prime exponents. In fact, the theorem of Faltings seemed ill- 
equipped for dealing with the finer questions raised by Fermat in his margin, 
namely of finding a complete list of rational points on all of the Fermat curves 
x” +y” = 1 simultaneously, and showing that there are no solutions on these 
curves when n > 3 except the obvious ones. 


Mazur’s work on Diophantine properties of modular curves: Although 
it was not realized at the time, the chain of ideas that was to lead to a proof 
of Fermat’s Last theorem had already been set in motion by Barry Mazur 
in the mid seventies. The modular curves Xo(@) and X, (£) introduced in 
section 1.2 and 1.5 give rise to another naturally occurring infinite family 
of Diophantine equations. These equations have certain systematic rational 
solutions corresponding to the cusps that are defined over Q, and are analogous 


'However, W. McCallum has recently introduced a technique, based on the method 
of Chabauty and Coleman, which suggests new directions for approaching Fermat’s Last 
Theorem via the cyclotomic theory. An application of McCallum’s method to showing the 
second case of Fermat’s Last Theorem for regular primes is explained in [Mc]. 


to the so-called “trivial solutions” of Fermat’s equation. Replacing Fermat 
curves by modular curves, one could ask for a complete list of all the rational 
points on the curves X9(@) and X,(¢). This problem is perhaps even more 
compelling than Fermat’s Last Theorem: rational points on modular curves 
correspond to objects with natural geometric and arithmetic interest, namely, 
elliptic curves with cyclic subgroups or points of order £. In [Maz1] and [Maz2], 
B. Mazur gave essentially a complete answer to the analogue of Fermat’s Last 
Theorem for modular curves. More precisely, he showed that if £ # 2,3,5 
and 7, (i.e., X1(@) has genus > 0) then the curve X,(¢) has no rational points 
other than the “trivial” ones, namely cusps. He proved analogous results for 
the curves Xo(£) in [Maz2], which implied, in particular, that an elliptic curve 
over Q with square-free conductor has no rational cyclic subgroup of order £ 
over Q if £ is a prime which is strictly greater than 7. This result appeared a 
full ten years before Faltings’ proof of the Mordell conjecture. 


Frey’s strategy: In 1986, Gerhard Frey had the insight that these construc- 
tions might provide a precise link between Fermat’s Last Theorem and deep 
questions in the theory of elliptic curves, most notably the Shimura Taniyama 
conjecture. Given a solution a‘ + bf = cf to the Fermat equation of prime 
degree £, we may assume without loss of generality that af = —1 (mod 4) and 
that bf = 0 (mod 32). Frey considered (following Hellegouarch, [He], p. 262; 
cf. also Kubert-Lang [KL], ch. 8, §2) the elliptic curve 


E : y? = r(x — af)(x + b°). 


This curve is semistable, i.e., it has square-free conductor. Let E[é] denote the 
group of points of order £ on E defined over some (fixed) algebraic closure Q 
of Q, and let L denote the smallest number field over which these points are 
defined. This extension appears as a natural generalization of the cyclotomic 
fields Q(¢e) studied by Kummer. What singles out the field L for special 
attention is that it has very little ramification: using Tate’s analytic description 
of E at the primes dividing abc, it could be shown that L was ramified only at 2 
and £, and that the ramification of L at these two primes was rather restricted. 
(See theorem 2.15 of section 2.2 for a precise statement.) Moreover, the results 
of Mazur on the curve Xo(¢) could be used to show that L is large, in the 
following precise sense. The space E|] is a vector space of dimension 2 over the 
finite field F, with ¢ elements, and the absolute Galois group Go = Gal (Q/Q) 
acts F,-linearly on E|¢]. Choosing an F;-basis for E[¢], the action is described 
by a representation 


PEL: Gal (L/Q) —> GL2( Fe). 


Mazur’s results in [Maz1] and [Maz2] imply that pry is irreducible if £ > 7 
(using the fact that E is semi-stable). In fact, combined with earlier results 
of Serre [Se6], Mazur’s results imply that for £ > 7, the representation pe is 
surjective, so that Gal (L/Q) is actually isomorphic to GLə(Fp) in this case. 


Serre’s conjectures: In [Se7], Jean-Pierre Serre made a careful study of mod 
L Galois representations J : Gg —> GLə(F;¢) (and, more generally, of repre- 
sentations into GL2(k), where k is any finite field). He was able to make very 
precise conjectures (see section 3.2) relating these representations to modular 
forms mod £. In the context of the representations Jg e that occur in Frey’s 
construction, Serre’s conjecture predicted that they arose from modular forms 
(mod £) of weight two and level two. Such modular forms, which correspond to 
differentials on the modular curve X0(2), do not exist because Xo(2) has genus 
0. Thus Serre’s conjecture implied Fermat’s Last Theorem. The link between 
fields with Galois groups contained in GLa(F¢) and modular forms mod £ still 
appears to be very deep, and Serre’s conjecture remains a tantalizing open 
problem. 


Ribet’s work: lowering the level: The conjecture of Shimura and Taniya- 
ma (cf. section 1.8) provides a direct link between elliptic curves and modular 
forms. It predicts that the representation Jg e obtained from the ¢-division 
points of the Frey curve arises from a modular form of weight 2, albeit a form 
whose level is quite large. (It is the product of all the primes dividing abc, 
where a‘ + b = c is the putative solution to Fermat’s equation.) Ribet [R5] 
proved that, if this were the case, then Pz,» would also be associated with a 
modular form mod £ of weight 2 and level 2, in the way predicted by Serre’s 
conjecture. This deep result allowed him to reduce Fermat’s Last Theorem to 
the Shimura-Taniyama conjecture. 


Wiles’ work: proof of the Shimura-Taniyama conjecture: In [W3] 
Wiles proves the Shimura-Taniyama conjecture for semi-stable elliptic curves, 
providing the final missing step and proving Fermat’s Last Theorem. After 
more than 350 years, the saga of Fermat’s Last theorem has come to a spec- 
tacular end. 

The relation between Wiles’ work and Fermat’s Last Theorem has been 
very well documented (see, for example, [R8], and the references contained 
therein). Hence this article will focus primarily on the breakthrough of Wiles 
[W3] and Taylor-Wiles [TW] which leads to the proof of the Shimura-Taniyama 
conjecture for semi-stable elliptic curves. 


From elliptic curves to -adic representations: Wiles’ opening gambit 
for proving the Shimura-Taniyama conjecture is to view it as part of the more 


general problem of relating two-dimensional Galois representations and mod- 
ular forms. The Shimura-Taniyama conjecture states that if Æ is an elliptic 
curve over Q, then E is modular. One of several equivalent definitions of mod- 
ularity is that for some integer N there is an eigenform f = >> anq” of weight 
two on To(N) such that 


#E(F,>) =p+1—a, 


for all but finitely primes p. (By an eigenform, here we mean a cusp form 
which is a normalized eigenform for the Hecke operators; see section 1 for 
definitions. ) 

This conjecture acquires a more Galois theoretic flavour when one considers 
the two dimensional ¢-adic representation 


PEL: Go TE G Lo(Zo) 


obtained from the action of Gg on the -adic Tate module of E: TE = 


lim E[/”"|(Q). An ¢-adic representation p of Gg is said to arise from an eigen- 


form f = >> anq” with integer coefficients an if 
tr (p(Frob p)) = ap, 


for all but finitely many primes p at which p is unramified. Here Frob , is a 
Frobenius element at p (see section 2), and its image under p is a well-defined 
conjugacy class. 

A direct computation shows that #E(F,) = p+ 1 — tr (pz ¢(Frob,)) for 
all primes p at which pg, is unramified, so that Æ is modular (in the sense 
defined above) if and only if for some @, pge arises from an eigenform. In 
fact the Shimura-Taniyama conjecture can be generalized to a conjecture that 
every f-adic representation, satisfying suitable local conditions, arises from a 
modular form. Such a conjecture was proposed by Fontaine and Mazur [FM]. 


Galois groups and modular forms 


Viewed in this way, the Shimura-Taniyama conjecture becomes part of a much 
larger picture: the emerging, partly conjectural and partly proven correspon- 
dence between certain modular forms and two dimensional representations 
of Gg. This correspondence, which encompasses the Serre conjectures, the 
Fontaine-Mazur conjecture, and the Langlands program for G L2, represents a 
first step toward a higher dimensional, non-abelian generalization of class field 
theory. 
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Two-dimensional representations of Gg: In the first part of this century, 
class field theory gave a complete description of GF, the maximal (continu- 
ous) abelian quotient of Gg. In fact the Kronecker-Weber theorem asserts that 
Gy = II, Z%, and one obtains a complete description of all one-dimensional 
representations of Gg. In the second half of this century much attention has 
focused on attempts to understand the whole group Gg, or more precisely to 
describe all its representations. Although there has been a fair degree of suc- 
cess in using modular forms to construct representations of Go, less is known 
about how exhaustive these constructions are. The major results in the lat- 
ter direction along these lines are the work of Langlands [L12] and the recent 
work of Wiles ([W3] completed by [TW]). Both concern two-dimensional rep- 
resentations of Gg and give significant evidence that these representations are 
parametrised (in a very precise sense) by certain modular forms. The purpose 
of this article is to describe both the proven and conjectural parts of this the- 
ory, give a fairly detailed exposition of Wiles’ recent contribution and explain 
the application to Fermat’s Last theorem. To make this description somewhat 
more precise let us distinguish three types of representation. 


Artin representations and the Langlands-Tunnell theorem: Contin- 
uous representations p : Gg — GL2(C) are called (two-dimensional) Artin 
representations. Such representations necessarily have finite image, and are 
therefore semi-simple. We restrict our attention to those which are irreducible. 
They are conjectured to be in bijection (in a precise way) with certain new- 
forms (a special class of eigenforms). Those p which are odd (i.e. the deter- 
minant of complex conjugation is —1), should correspond to weight 1 holo- 
morphic newforms. Those which are even should correspond to certain non- 
holomorphic (Maass) newforms. Two partial but deep results are known. 


(a) (Deligne-Serre) If f is a holomorphic weight one newform then the cor- 
responding Artin representation can be constructed ({DS]). 


(b) (Langlands-Tunnell) If p is a two dimensional Artin representation with 
soluble image then the corresponding modular form exists ([L12] and 
[Tu]). 


The proof of the latter result is analytic in nature, invoking the trace formula 
and the theory of L-functions. 


(-adic representations and the Fontaine-Mazur conjecture: By an 4- 
adic representation we shall mean any continuous representation p : Gg — 
GL2(K) which is unramified outside a finite set of primes and where K is a 
finite extension of Q; (generalizing slightly the notion of ¢-adic representation 
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that was introduced before). Given a holomorphic newform f one can attach 
to f a system of ¢-adic representations, following Eichler, Shimura, Deligne and 
Serre. These ¢-adic representations are called modular. The Fontaine-Mazur 
conjecture (see [FM]) predicts if p is an odd, irreducible, ¢-adic representation 
whose restriction to the decomposition group at £ is well enough behaved, 
then p is modular. (The restriction on the behaviour of the representation 
on the decomposition group at £ is essential in this conjecture; it is not true 
that all odd, irreducible two dimensional ¢-adic representation are modular.) 
Before Wiles’ work almost nothing was known about this conjecture, except 
that certain very special cases could be deduced from the work of Hecke, 
Langlands and Tunnell. 


Mod £ representations and Serre’s conjecture: A mod £ representation 
is a continuous representation J : Gg —> GL»(F,). For example if E/Q is an 
elliptic curve then the action of Gg on the ¢-division points of E gives rise to a 
mod £ representation Pz, which is just the reduction modulo £ of pge. One can 
use the work of Eichler, Shimura, Deligne and Serre to associate to each mod 
L eigenform a mod £ representation of Gg. The mod £ representations which 
arise in this way are called modular. Serre has conjectured [Se7] that every 
odd (absolutely) irreducible mod £ representation is modular and should arise 
from a mod £ eigenform with certain very specific properties. This conjecture 
can be thought of as having two parts. 

The first asserts that every odd irreducible mod £ representation is modular. 
About this very little is known. It is known for p : Gg —> GLo(F2) by work 
of Hecke. It is also known for J : Gg —> GL»(F3). This latter result is an 
application of the Langlands-Tunnell theorem using the two accidents that 
there is a section to the homomorphism G'L2(Z[V/—2]) —> GL2(F3) and that 
GL»2(F3) is soluble. Partial results for p : Go — GL2(F5) follow from Wiles’ 
work. 

Given a mod £ representation arising from a mod £ eigenform, the second 
part of Serre’s conjecture predicts the minimal weight and level for that mod 
L eigenform. Here the situation is much better. There has been a lot of work 
over the last decade (including ideas from Mazur, Ribet, Carayol and Gross) 
and the problem is nearly completely resolved (see [Dil]). As was pointed 
out earlier, Ribet’s contribution [R5] implies that, if one can show that the 
Galois representation Jg arising from the (semi-stable) Frey curve attached 
to a solution of Fermat’s equation with exponent £ is modular, then one can 
show that this representation does not exist—because it would be modular of 
weight two and level two— and hence one can deduce Fermat’s Last Theorem. 

However we have seen that to show ge is modular it suffices to show that 
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for some £o, the @o-adic representation pg, is modular. In particular it suffices 
to verify that either pg3 or pgs is modular. Hence the Shimura-Taniyama 
conjecture can be reduced to (part of) the Fontaine-Mazur conjecture for 0 = 3 
and 5. We have seen that for these primes part of Serre’s conjecture is known, 
so it turns out it suffices to prove results of the form “Serre’s conjecture for £ 
implies the Fontaine-Mazur conjecture for @”. This is the direction of Wiles’ 
work, although nothing quite this general has been proven yet. 


Deformation theory: Thus the problem Wiles faces is to show that if p is 
an odd ¢-adic representation which has irreducible modular reduction p and 
which is sufficiently well behaved when restricted to the decomposition group 
at £, then p is modular. In fact he only proves a weakened version of such a 
result, but one which is sufficient to conclude that all semistable elliptic curves 
are modular. 

Wiles approaches the problem by putting it in a more general setting. On 
the one hand he considers lifts of p to representations over complete noetherian 
local Z:-algebras R. For each finite set of primes X, one can consider lifts of 
type X; these are lifts which are well-behaved on a decomposition group at £, 
and whose ramification at primes not in © is rather restricted. In particular, 
such a lift is unramified outside X US where S is the set of ramified primes 
of p. A method of Mazur (see |Maz3]) can then be used to show that if Ø is 
absolutely irreducible, then there is a representation 


ps’ : Gg — GL2(Rz) 


which is universal in the following sense. If p : Gg —> GLo(R) is a lift of p of 
type J, then there is a unique local homomorphism Ry —> R such that p is 
equivalent to the pushforward of p¥”’. Thus the equivalence classes of type © 
lifts to GL2(R) can be identified with Hom(Ry, R). The local ring Ry is called 
the universal deformation ring for representations of type X. 

On the other hand Wiles constructs a candidate for a universal modular 
lifting of type X 

pare ; Go = GL(Ts). 


The ring Ty is constructed from the algebra of Hecke operators acting on 
a certain space of modular forms. The universal property of Ry gives a 
map Ry — Ty. The problem thus becomes: to show that this map is an 
isomorphism”. In fact, it can be shown to be a surjection without great. dif- 


?Maps of this kind were already considered in [Maz3] and [BM], and it is conjectured in 
[MT] that these maps are isomorphisms in certain cases, though not in exactly the situations 
considered by Wiles. 
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ficulty, and the real challenge is to prove injectivity, i.e., to show, in essence, 
that Ry is not larger than Ty. 

By an ingenious piece of commutative algebra, Wiles found a numerical 
criterion for this map to be an isomorphism, and for the ring Ts to be a 
local complete intersection. This numerical criterion seems to be very close 
to a special case of the Bloch-Kato conjecture [BK]. Wiles further showed 
(by combining arguments from Galois cohomology and from the theory of 
congruences between modular forms) that this numerical criterion was satisfied 
if the minimal version Tø of this Hecke algebra (obtained by taking © = Q, i.e., 
allowing the least possible amount of ramification in the deformations) was a 
complete intersection. Finally in [TW] it was proved that Tg is a complete 
intersection. 


Outline of the paper 


Chapter 1 recalls some basic notions from the classical theory of elliptic curves 
and modular forms, such as modular forms and modular curves over C and Q, 
Hecke operators and g-expansions, and Eichler-Shimura theory. The Shimura- 
Taniyama conjecture is stated precisely in section 1.8. 

Chapter 2 introduces the basic theory of representations of Go. We describe 
Mazur’s deformation theory and begin our study of the universal deformation 
rings using techniques from Galois cohomology and from the theory of finite 
flat group schemes. We also recall some basic properties of elliptic curves, 
both to explain Frey’s argument precisely and illustrate the uses of (-adic 
representations. 

Chapter 3 explains how to associate Galois representations to modular 
forms. We then describe what was known and conjectured about associating 
modular forms to Galois representations before Wiles’ work. After introducing 
the universal modular lifts of certain mod £ representations, we give the proof 
of Wiles’ main theorems, taking for granted certain results of a more technical 
nature that are proved in the last two chapters. 

Chapter 4 explains how to prove the necessary results concerning the struc- 
ture of Hecke algebras: the generalization by Taylor and Wiles of a result of 
de Shalit, and the generalization by Wiles of a result of Ribet. 

Chapter 5 establishes the fundamental results from commutative algebra 
discovered by Wiles, following modifications of the approach of Wiles and 
Taylor-Wiles proposed by Faltings and Lenstra. 
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1 Elliptic curves and modular forms 


1.1 Elliptic curves 


We begin with a brief review of elliptic curves. A general reference for the 
results discussed in this section is [Sil] and [Si2]. 

An elliptic curve E over a field F is a proper smooth curve over F of genus 
one with a distinguished F-rational point. If E/F is an elliptic curve and if w 
is a non-zero holomorphic differential on Æ/F then E can be realised in the 
projective plane by an equation (called a Weierstrass equation) of the form 


(W)  Y?Z+a,XYZ+a3YZ? = X? + a2X?Z +a,X Z’ + aZ’ 


such that the distinguished point is (0 : 1 : 0) (sometimes denoted oo because 
it corresponds to the “point at infinity” in the affine model obtained by setting 


Z = 1) and w = PE T We also define the following quantities associated 
to (W): 
bə = at + 4a b4 = 2a4 + a1a3 be = ar + 4ag 


bs = a?as + 4agag — a103đ4 + a203 = a; 


One can check that the equation (W) defines an elliptic curve if and only if 
A is nonzero. One can also check that such equations define elliptic curves 
which are isomorphic over F if and only if they give the same quantity j. Thus 
j only depends on E so we will denote it jz. The quantity A depends only 
on the pair (E,w) so we shall denote it A(E,w). If u belongs to F* then 
u? A(E, uw) = A(E, w). 

An elliptic curve E/F has a natural structure of a commutative algebraic 
group with the distinguished F-rational point as the identity element. 

An algebraic map between two elliptic curves which sends the distinguished 
point of one to the distinguished point of the other is automatically a morphism 
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of algebraic groups. A map between elliptic curves which has finite kernel (and 
hence, is generically surjective) is called an isogeny. 


Elliptic curves over C: If F = C, then the curve E is isomorphic as a 
complex analytic manifold to the complex torus C/A, where A is a lattice 
in C, i.e., a discrete Z-submodule of C of rank 2. The group law on E(C) 
corresponds to the usual addition in C/A. In terms of A, an affine equation 
for E in A?(C) is given by 


Y? = 4r? + gor + 93, 


g=—60 `` z g =-140 *` T 


AcA—{0} AEA—{O} 


where 


In terms of this equation, the map from C/A to E(C) sends z to (x,y) = 
(olz), o'(z)), where p(z) is the Weierstrass g-function associated to the lattice 
A. (Cf. [Sil], ch. VI.) The inverse map is given by integrating the holomorphic 
differential w, i.e., sending P € E(C) to the image of fw in C/A, where y is 
any path on E(C) from oo to P, and A is the lattice of periods J, w, where y 
ranges over the integral homology Hı(E(C), Z). Replacing w by uw changes 
A to uA, so that A is determined by E only up to homotheties. We scale A so 
that one of its Z-generators is 1, and another, 7, has strictly positive imaginary 
part. This gives the analytic isomorphism: 


E(C) ~ C/(1,7). 


The complex number 7 in the complex upper half plane H is well defined, 
modulo the natural action of SZ2(Z) on H by Mobius transformations. (Thus 
the set of isomorphism classes of elliptic curves over C can be identified with 
the quotient H/SL2(Z).) 

The map z + e?"” identifies C/(1,7) with C*/q”%, where q = e?""" is the 
multiplicative Tate period. The analytic isomorphism 


E(C) ~ C* /q” 


has the virtue of generalizing to the p-adic setting in certain cases, as we will 
see shortly. 

Note that |g] < 1. The invariant j can be expressed in terms of q by a 
convergent power series with integer coefficients: 


j = q7 +744 + 196884q +-->. (1.1.1) 


The following basic facts are a direct consequence of the analytic theory: 
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Proposition 1.1 The subgroup E|n|(C) of points of order n on E(C) is iso- 
morphic (non-canonically) to Z/nZ x Z/nZ. More generally, if F is any field 
of characteristic zero, the subgroup E[n] (F) is contained in Z/nZ x Z/nZ. 


Proof: The analytic theory shows that Æ(C) is isomorphic as an abstract group 
to a product of two circle groups, and the first statement follows. The second 
statement follows from the Lefschetz principle (cf. [Sil], ch. VI, 86). o 


Proposition 1.2 The endomorphism ring End ¢(E) of an elliptic curve over 
C is isomorphic either to Z or to an order in a quadratic imaginary field. The 
same is true if one replaces C by any field of characteristic 0. 


Proof: An endomorphism of E(C) ~ C/A induces multiplication by complex 
number a on the tangent space. Hence End ¢(£) is isomorphic to the ring 
of a € C satisfying aA C A. Such a ring is isomorphic either to Z or to a 
quadratic imaginary order. The corresponding statement for fields of charac- 
teristic 0 follows as in the proof of proposition 1.1. 

If Endc(£) ® Q is a quadratic imaginary field, we say that Æ has complex 
multiplication. 


Remark 1.3 It follows from the arithmetic theory of complex multiplication 
(cf. [Si2], ch. 1) that any elliptic curve Æ with complex multiplication is defined 
over an abelian extension of the quadratic imaginary field K = End c(E) @Q. 
If E is defined over Q, then K has class number one. There are only finitely 
many elliptic curves over Q with complex multiplication, up to “twists” (i.e., 
C-isomorphism). 


Elliptic curves over Q,: Now suppose that F is an elliptic curve defined 
over the p-adic field Q,. There is an equation 


(wm) VOL +a XYZ Pa Z? = X?’ ag aaa + aZ’ 


for E with the property a; € Z, for all ¿ and |A| is minimal amongst all such 
equations for Æ. Although (W™'®) is not unique, the associated discriminant 
depends only on E and is denoted AĮ™, Moreover the reduction of (W™”) 
modulo the uniformizer p defines a projective curve Æ, which is independent 
of the particular minimal equation chosen. If (W) is any equation for E with 
coefficients in Zp and with discriminant A, then A'#™ divides A. 

If Ẹ is a smooth curve we say that E has good reduction at p. If E has a 
unique singular point which is a node we say that E has multiplicative reduction 
at p. Otherwise E has a unique singular point which is a cusp and we say that 
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E has additive reduction at p. If E has good or multiplicative reduction we 
say that it has semi-stable reduction at p, or simply that E is semz-stable. 

If (W) defines a smooth curve mod p then Æ has good reduction at p 
and (W) is a minimal equation. If A = 0 mod p but b3 4 24b4 mod p, then 
modulo p the equation (W) defines a curve with a node. In this case Æ has 
multiplicative reduction at p and (W) is a minimal equation. 


Curves with good reduction: In that case p does not divide A™, and the 
reduction £ is an elliptic curve over F,,. 

If q is any power of p, and F, is the field with q elements, we define the 
integer N, to be the number of solutions to the equation (W™") in the projec- 
tive plane P?(F,). Thus N, is the order of the finite group E(F,). We define 
the integer a, by the formula 


ag =Qt1—N,. 
The integers a, are completely determined by a,: more precisely, we have 


(1 = app À +p”)! = 1 + agp ahs app” + app” ETER 
(1.1.2) 


We call the expression on the left the (local) L-function associated to Æ 
over Q,, and denote it by L(E/Qp, s). Concerning the size of a, we have the 
following fundamental result of Hasse, whose proof can be found in [Sil], ch. 
V, 81: 


Theorem 1.4 |a,| < 2\/p. 


A further division among curves of good reduction plays a significant role 
in our later discussion. We say that Æ has (good) ordinary reduction if p does 
not divide ap, and that it has supersingular reduction if p divides ap. 

When F has good reduction at p, we define its local conductor at p to be 
m (E) = 0. 


Curves of multiplicative reduction: Elliptic curves over Q, which have multi- 
plicative reduction at p can be understood by using the p-adic analytic de- 
scription discovered by Tate. More precisely, we can formally invert the power 
series (1.1.1) expressing j in terms of q, to obtain to a power series for q in 
jt, having integer coefficients: 


q = j7} +7447? + 750420}? + 872769632j~4 + --- 
(1.1.3) 
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If E has multiplicative reduction, then j € Q, is non-integral, and hence the 
power series (1.1.3) converges, yielding a well-defined value of q in pZ,. This 
is called Tate’s p-adic period associated to E over Q,. Note that we have 
vyla) = —vp(j) = vA g). 

We say that E has split (resp. non-split) multiplicative reduction at p if 
the two tangent lines to the node on E(F,) have slopes defined over F, (resp. 


2) 


Proposition 1.5 (Tate) There is a p-adic analytic isomorphism 
}: Qž/ — EQ), 
which has the property that 
o(®(x)) = Plor), Yoe GQ,» 
where ô : Go, — +1 is 
e the trivial character, if E has split multiplicative reduction; 


e the unique unramified quadratic character of Gg,, if E has non-split 
multiplicative reduction. 


The proof of this proposition is explained in [Si2], ch. V, for example. 
We define the L-function L(E/Q,, s) to be 


(1—p-*)~! if E has split reduction, 
—s\—1 


L(E/Qp, 8) = l (1+ p~’) if E has non-split reduction. 


(1.1.4) 


In both cases the conductor m,(£) is defined to be 1. 


Curves of additive reduction: If E has additive reduction at p, we simply define 
L(E/Q,, 8) = 1. (1.1.5) 


The conductor mp (E) is defined to be 2, if p > 3. When p = 2 or 3, it is 
determined by a somewhat more complicated recipe, given in [Ta]. 


Elliptic curves over Q: Let E be an elliptic curve defined over Q. In 
particular Æ may be viewed as a curve over Q, for every p, and we define its 


(global) conductor by 
Ng Z [[e™”: 
p 
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The curve EF is said to be semi-stable if it is semi-stable over all p-adic fields 
Qp. Note that E is semi-stable if and only if its conductor Nz is square-free. 

Using the fact that Q has class number 1, one can show that E has a global 
minimal Weierstrass model (W™") which gives the equation of a minimal 
Weierstrass model over each Q,. The associated discriminant, denoted A#™, 
depends only on Æ. The associated differential, denoted wX"", is called the 
Néron differential. It is well-defined up to sign. 

The following, known as the Mordell-Weil theorem, is the fundamental 
result about the structure of the group of rational points E(Q). (Cf. For 
example [Sil].) 


Theorem 1.6 The group E(Q) is a finitely generated abelian group. Hence 
EQ) STEZ, 


where T is the (finite) torsion subgroup of E(Q), and r > 0 is the rank of E 
over Q. 


Concerning the possible structure of T, there is the following deep result of 
Mazur, a variant of which also plays a crucial role in the proof of Fermat’s 
Last Theorem: 


Theorem 1.7 If E/Q is an elliptic curve, then its torsion subgroup is iso- 
morphic to one of the following possibilities: 


Link, 1<n<10, n=12, Z/2nZ x Z/2Z, 1<n<4. 


The proof is given in [Maz1] (see also [Maz2]). Thanks to this result, the 
structure of the torsion subgroup T is well understood. (Recently, the tech- 
niques of Mazur have been extended by Kamienny [Kam] and Merel [Mer] to 
prove uniform boundedness results on the torsion of elliptic curves over general 
number fields. ) 

Much more mysterious is the behaviour of the rank r. It is not known if r 
can be arbitrarily large, although results of Mestre [Mes] and Nagao [Na] show 
that it is greater or equal to 13 for infinitely many elliptic curves over Q. It 
turns out that many of the deep results on E(Q) and on r are based on the 
relation with L-functions. 

We define the global L-function of the complex variable s by: 


L(E/Q,s) = | | LZ/Q, 8). (1.1.6) 
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Exercise 1.8 Using theorem 1.4, show that the infinite product defining the 
L-function L(EÆE/Q, s) converges absolutely on the right half plane Real(s) > 
3/2. 


Conjecture 1.9 (Birch-Swinnerton-Dyer) The L-function L(E/Q, s) has 
an analytic continuation to the entire complex plane, and in particular is ana- 
lytic at s = 1. Furthermore: 


ords=ıL(E/Q, s) =r. 


There is also a more precise form of this conjecture, which expresses the leading 
coefficient of L(E/Q, s) at s = 1 in terms of certain arithmetic invariants of 
E/Q. For more details, see [Sil], conj. 16.5. 

As we will explain in more detail in section 1.8, the analytic continuation 
of L(E/Q,s) now follows from the work of Wiles and Taylor-Wiles and a 
strengthening by Diamond [Di2], for a very large class of elliptic curves over 
Q, which includes all the semi-stable ones. 


Abelian varieties: Elliptic curves admit higher-dimensional analogues, called 
abelian varieties, which also play a role in our discussion. Analytically, the set 
of complex points on an abelian variety is isomorphic to a quotient C9/A, 
where A is a lattice in CI of rank 2g, satisfying the so-called Riemann period 
relations. A good introduction to the basic theory of abelian varieties can be 
found in [CS] and [Wel]. 


1.2 Modular curves and modular forms over C 


Modular curves: The group SL2(Z) of two by two integer matrices of deter- 
minant one acts by fractional linear (Mobius) transformations on the complex 
upper half plane 

H = {z €C | Im(z) > 0}, 


equipped with its standard complex analytic structure. The principal con- 
gruence group T(N) of level N is the subgroup of matrices in SL2(Z) which 
reduce to the identity matrix modulo the positive integer N. A subgroup T of 
SL2(Z) is called a congruence group if it contains T(N) for some N. The level 
of T is the smallest N for which this is true. The most important examples of 
congruence groups are: 


e The group I'o(V) consisting of all matrices that reduce modulo N to an 
upper triangular matrix. 
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e The group I'|(V) consisting of all matrices that reduce modulo N to a 


matrix of the form ( : i ) 


e The principal congruence group I (N) of level N consisting of all matrices 
that reduce modulo N to the identity. 


Notice the natural inclusions of normal subgroups T(N) C Ti(N) Cc ToN). 
The quotient [9(N)/T,(NV) is canonically isomorphic to (Z/NZ)* via 


< i) dod, 
c d 


For any subgroup H of (Z/NZ)*, we denote by T(N) the group of matrices 
in [9(V) whose image in ro(N)/T1(N) belongs to H. 

If T is a congruence subgroup of SL2(Z), define Yp to be the quotient of 
the upper half plane H by the action of I. One equips Yr with the analytic 
structure coming from the projection map m : H —> Yr. (More precisely, if 
y = (rT), and G, CT is the stabilizer of 7 in I, then the local ring Oy,., is 
identified with the local ring of germs of holomorphic functions at 7 which are 
invariant under the action of G,.) This makes Yp into a connected complex 
analytic manifold of dimension one, i.e., a Riemann surface. If lis To(N) (resp. 
Ti (N), or T(N)), we will also denote Yr by Yo(V) (resp. Yi(V), or Y(N)). One 
compactifies Yp by adjoining a finite set of cusps which correspond to orbits 
of P!(Q) = QU {oo} under F. Call Xp the corresponding compact Riemann 
surface. (For more details, notably on the definition of the analytic structure 
on Xp at the cusps, see for example [Kn], p. 311, or [Shi2], ch. 1.) It follows 
from the definition of this analytic structure that the field Kr of meromorphic 
functions on Xr is equal to the set of meromorphic functions on H satisfying 


e (Transformation property): f(y7) = f(T), for all y E€ T; 


e (Behaviour at the cusps): For all y € SL2(Z), the function f(y7) has a 
Puiseux series expansion X`% a,q"/" in fractional powers of q = e?""”. 


Riemann’s existence theorem (cf. for example [For], ch. 2) asserts that the 
analytic structure on Xr comes from an algebraic one, i.e., the field Kr is a 
finitely generated extension of C of transcendence degree 1. Thus we can, and 
will, view Xp as a complex algebraic curve over C. If T is Po(V) (resp. Pi (V), 
or [(N)), we will also denote Xp by Xo(N) (resp. X(N), or X(N)). 


Examples and exercises: 
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1. For N = 1, the curve Xo(N) = X(N) = X(N) is a curve of genus 0, and 
its field of functions is the ring C(j), where 7 is the classical modular function, 


j(r) = qt +7444 196884g+---, q= e". 


(Cf., for example, [Se4], ch. 7.) 


2. For rT € HUP!(Q) = H, define G, to be the stabilizer of r in PSL2(Z), 
and let e- = #(G,/(G, NT)). Show that e, depends only on the T-orbit 
of r in H, and that e, = 1 for all but finitely many rt in H/T. Using the 
Riemann-Hurwitz formula (cf. [Ki], sec. 4.3) show that the genus of Xp is 
given by 


qh Site (Psnelle--y ea): 


Use this to compute the genus of Xo(p), Xı(p), and X(p) for p prime. For 
details, see [Shi2], sec. 1.6 or [Ogg]. 


3. For I =I[(2), show that Xr is isomorphic to Pt, and that Yp is isomorphic 
to Pt — {0,1,00}. Show that ['/(+1) is the free group on the two generators 


s= (B2) and ge = (39). 


4. Define a homomorphism ['(2) — Z/nZ x Z/nZ, by sending gı to (1,0) 
and gə to (0,1), and let I denote its kernel. Show that I is not in general a 
congruence subgroup and that the curve Yp := H/T is birationally isomorphic 
to the Fermat curve of degree n with affine equation z” + y” = 1. 


Moduli interpretations: The points in Yr = H/T can be interpreted as 
elliptic curves over C with some extra “level N” structure. More precisely, 


e If IT = [,(N), then the T-orbit of r € H corresponds to the complex 
torus Æ = C/(1,r) with the distinguished cyclic subgroup of order N 
generated by ne Thus, points on Yo(V) parametrize isomorphism classes 
of pairs (E, C) where E is an elliptic curve over C and C is a cyclic 
subgroup of E of order N. 


e If T = T(N), then the [-orbit of r corresponds to the complex torus 
E = C/(1, T) with the distinguished point of order N given by x Hence, 
points on Yı(N) parametrize isomorphism classes of pairs (E, P) where 
now P is a point on E of exact order N. 


Remark 1.10 One checks that the above rules set up a bijection between 
points on Yp and elliptic curves with the appropriate structures, and that the 


24 


projection Y;(N) — Yo(NV) sending r1(N)r to To(N)r becomes the “forget- 
ful” map sending (E, P) to (E, (PY). 


Remark 1.11 (This remark will be used in section 1.3 when discussing Hecke 
operators.) Define an n-isogeny of I-structures to be an n-isogeny of the 
underlying elliptic curves which sends one I-structure to the other. If p is a 
prime not dividing N, then there are exactly p + 1 distinct p-isogenies from 
(C/(r,1), 4), whose images are the pairs: 


(c/(*1),5) (i=0,...,p-1),  (C/lpr.1), 2). 


If p divides N, then there are only p distinct p-isogenies from (C/(r, 1), 4), 
since (C/ (pr, 1), $) is not a I (N )-structure (the point p/N not being of exact 
order N on the complex torus C/(pr, 1)). 


Modular forms: Let k be an even positive integer. A modular form of weight 
k on T is a holomorphic function f on H satisfying: 


e (Transformation property): f(yr) = (cr+d)* f(r), for all y= e t) E€ 
r. 


e (Behaviour at the cusps): For all y € PSL2(Z), the function (cr + 
d)—* f(yT) has a Puiseux series expansion X> a,g"/" in fractional powers 
of q = e°", We call X anq™* the Fourier expansion of f at the cusp 
y7} (ioo). 


A modular form which satisfies the stronger property that the constant 
coefficient of its Fourier expansion at each cusp vanishes is called a cusp form. 
We denote by M,(T) the complex vector space of modular forms of weight k 
on I’, and by S(T) the space of cusp forms on I. (For examples, see [DI], 
sec. 2.2 and the references therein, especially, [Shi2], ch. 2.) 

This article is mainly concerned with modular forms of weight 2, and hence 
we will focus our attention on these from now on. A pleasant feature of the case 
k = 2 is that the cusp forms in S(T) admit a direct geometric interpretation 
as holomorphic differentials on the curve Xr. 


Lemma 1.12 The map f(T) > wy := 2nif(r)dr is an isomorphism between 
the space S(T) and the space Q'(Xy) of holomorphic differentials on the curve 
Xp. 
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Sketch of proof: One checks that the transformation property satisfied by f(T) 
under T causes the expression f(T)dr to be T-invariant, and that the condition 
of vanishing at the cusps translates into holomorphicity of f(7)dr. (Note, for 
example, that 2ridr = dq/q, so that wy is holomorphic at ioo precisely when 
f(q) vanishes at q = 0.) 
As a corollary, we find: 


Corollary 1.13 The space S(T) is finite-dimensional, and its dimension is 
equal to the genus g of Xr. 


Proof: This follows directly from the Riemann-Roch theorem, cf. [Ki], sec. 6.3. 
O 

To narrow still further the focus of our interest, we will be mostly concerned 
with the cases T = [9(N) and T,(V). A slightly more general framework is 
sometimes convenient, so we suppose from now on that I satisfies 


Ti(N) CP CT o(N). 


Such a group I is necessarily of the form [y(V) for some subgroup H of 
(Z/NZ)*. Because the transformation T + 7 + 1 belongs to [ the forms 
in S(T) are periodic functions on H of period 1, and hence their Fourier 
expansions at ioo are of the form 


f(T) = Sod q = e an EC. 


n>0 


The Petersson inner product: The spaces S(T) are also equipped with a 
natural Hermitian inner product given by 


H= ga f orno f IONA 


where T = x + iy. This is called the Petersson inner product. 


The diamond operators: Suppose now that T = T(N) and let d be an ele- 
ment of (Z/NZ)*. The map (d) which sends an elliptic curve with [-structure 
(E, P) to the pair (E, dP) gives an automorphism of Yr which extends to Xp. 


It is called the diamond operator. For tT in H and y = k t) in To(N), we 


have 
(d) TT) = r7). 
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Hence (d) acts on S(T), identified with the holomorphic differentials on Xp, 
by the rule 


pf OP +b 
aro = eras (EZ). 
In geometric terms, the diamond operators are the Galois automorphisms of 
the natural (branched) covering X;(N) —> Xo(N) whose Galois group is 
isomorphic to [o(N)/(£TP1(N)) = (Z/NZ)*/(+1). Given an even Dirichlet 
character x : (Z/NZ)* —+ C%, say that f is a modular form of level N and 
character x if it belongs to the x-eigenspace in S2(Tı(N)) under this action. 
Let S(N,x) denote the space of all such forms. Thus a function f in S2(N, x) 
is a cusp form on [';(N) which satisfies the stronger transformation property: 


f (==) = y(d)(er + d)? f(r), for all (24) ET (N). 


Note that if 1 is the trivial character, then Sə(N,1) is canonically identified 
with S2(To(N)), which we will also denote by S(N). Finally note the direct 
sum decomposition: 


So(Pi(V)) = 8yS2(N, x), 


where the sum ranges over all the even Dirichlet characters modulo N. 


Exercise 1.14 Show that if f(T) belongs to S2(N), then f(ar) belongs to 
So(mN), for each integer a dividing m. 


Jacobians of modular curves: Let V be the dual space 
V = S(T)“ := Hom(S2(T), ©). 


It is a complex vector space of dimension g = genus(Xr). The integral ho- 
mology A = Hı(Xr, Z) maps naturally to V by sending a homology cycle c 
to the functional ¢, defined by ¢.(f) = f ws. The image of A is a lattice in 
V, i.e., a Zmodule of rank 2g which is discrete (cf. [Mul], cor. 3.8). Fix a 
base point 7 € H, and define the Abel-Jacobi map as : Xp(C) — V/A by 
asl Pf) = S wy. Note that this is well-defined, i.e., it does not depend on 
the choice of path on Xr from 7) to P, up to elements in A. 

We extend the map ay by linearity to the group Div (Xr) of divisors on 
Xp, and observe that the restriction of a; to the group Div °(Xp) of degree 
0 divisors does not depend on the choice of base-point 7). Moreover we have 
the Abel-Jacobi theorem: 
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Theorem 1.15 The map 
az : Div°(Xp) — V/A 


has a kernel consisting precisely of the group P(Xr) of principal divisors on 
Xp. Hence ay induces an isomorphism from Pic °(Xp) := Div °(Xp)/P(Xp) 
to V/A. 


For the proof, see [Mul], ch. 3. The quotient V/A is a complex torus, and is 
equal to the group of complex points of an abelian variety. We denote this 
abelian variety by Jp, the Jacobian variety of Xp over C. If T = [o(N) or 
T(N), we will also write Jo(N) or Jı(N) respectively for the Jacobian Jr. 


1.3 Hecke operators and Hecke theory 
We maintain our running assumption that I satisfies 
(N) CT CTN). 


If p is a prime not dividing the level N, we define the Hecke operator T, on 
S(T) by the formula 


We give a more conceptual description of T, in terms of remark 1.11 in the 
case I = T(N). We have 


wni) = >, Gi (wy), 


where ¢;(T) = and @.(T) = (p)pr represent the p + 1 curves with T- 
structure that are images of (C/(r, 1), 4) by a p-isogeny, and the ¢* are the 
pull-back maps on differential forms on H. (An isogeny of elliptic curves with 
I-structure is simply an isogeny between the underlying curves which sends 
one [-structure to the other.) Such a description makes it evident that T,(f) 
belongs to S(T), if f does. In terms of the Fourier expansion of f = X` ang”, 


the formula for the operator T, on S2(N, x) is given by: 


Tolf) = do ang”? + px(p) X ang". 


pin 
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If p divides N, then we define the Hecke operator U, analogously, by summing 
again over all the cyclic p-isogenies of I-structures. Since there are only p of 
them, the formula becomes simpler: 


The reader is invited to check that the Hecke operators of the form T, or 
U, commute with each other, and also that they commute with the diamond 
operators introduced in the previous section. 

We extend the definition of the Hecke operators to operators Ty», with 
n > 1, by the inductive formulae 


Lat = Tl pe =p pl pts, if (p, NA, 


and Tp» = Uy’ otherwise. We then define the operator Tp, where n = [| př is 
written as a product of powers of distinct primes p;, by 


= Its. 


This definition makes the Hecke operators multiplicative, i.e., TmIn = Tmn if 
(m,n) = 1. (A more conceptual definition of the Hecke operator Tn is that 
T,(f) is obtained by summing the pullback of wy over the maps describing 
all the cyclic n-isogenies of [-structures.) The relations among the different 
Hecke operators can be stated succinctly by saying that they obey the following 
(formal) identity: 


I-i + (pp?) [[G - Upp) 7 = SO Tan. 


pAN pIN (1.3.1) 


The reader can consult [DI], sec. 3 and the references therein (especially [Shi2], 
ch. 3 or [Kn]) for more details and different points of view on Hecke opera- 
tors. Let T be the subring of Ende(S-(T)) generated over C by all the Hecke 
operators T, for pN, U, for q|N, and (d) acting on S(T). 


Definition 1.16 A modular form f is an eigenform if it is a simultaneous 
eigenvector for all the Hecke operators in T, i.e., if there exists a C-algebra 
homomorphism A : T — C such that Tf = A(T) f, for all T € T. 


A direct calculation shows that the coefficients a, of an eigenform f can be 
recovered from the homomorphism A by the formula: 


anl f) = ai(f)ACTh)- 
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It follows that the first Fourier coefficient a, of a non-zero eigenform is always 
non-zero, and that the non-trivial eigenspaces for T are all one-dimensional: 


Proposition 1.17 Given a non-zero algebra homomorphism A : T — C, 
there is exactly one eigenform f up to scaling, which satisfies Tf = A(T)f, 
for all T ET. 


Sketch of proof. The proof of the existence of f is an exercise in commutative 
algebra (localize S(T) at the kernel of A), and the uniqueness is clear from 
the formula above. 

We call an eigenform satisfying a; = 1 a normalized eigenform. 


Atkin-Lehner theory: It is natural to ask whether S2(T) can be decomposed 
into a basis consisting of distinct normalized eigenforms. Unfortunately, this 
is not always possible, as the following exercise illustrates. 


Exercise 1.18 Suppose that p? divides N exactly. Let T’ be the algebra of 
Hecke operators (generated by the operators T} with qJN/p*, and U, with 
q|N/p*) acting on So(N/p?). Let f = X7; ang” be a T’-eigenform of level 
N/p? in S2(N/p?). Show that the space S spanned by the forms f(T), f(pr), 
f(p?r), and f(p?r) is contained in S2(N), and is stable for the action of the 
Hecke operators T}, g{N, and U,, q|N. Show that Sp has no basis of simulta- 
neous eigenforms for the Hecke algebra T of level N, so that the action of T 
on Sy is not semi-simple. 


Let T° denote the subalgebra of T generated only by the “good” Hecke oper- 
ators T} with qJN, and (d). 


Proposition 1.19 If q does not divide N, the adjoint of the Hecke operator 
T, with respect to the Petersson scalar product is the operator (q)~'T,, and the 
adjoint of (q) is (q)~*. In particular, the Hecke operators commute with their 
adjoints. 


Proof: See [Kn], th. 9.18 and 8.22, or [Ogg]. 
Proposition 1.19 implies, by the spectral theorem for commuting operators 
that commute with their adjoints: 


Proposition 1.20 The algebra T° is semi-simple (i.e, it is isomorphic to a 
product C x --- x C of a certain number of copies of C), and there is a basis 
of S2(V) consisting of simultaneous eigenvectors for the operators T}. 
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Thus, T° has the merit of being semi-simple, while T is not in general. The 
cost of replacing T by T°, however, is that one loses “multiplicity one”, i.e., 
the eigenspaces for T° need not be one-dimensional. For example, the space 
Sy defined in the previous exercise is contained in a single eigenspace for T°. 

The theory of Atkin-Lehner [AL] gives essentially a complete understanding 
of the structure of the algebra T, and the structure of the space of eigenforms. 
To motivate the main result, observe that the problem in the exercise above 
seems to be caused by forms of level N that are coming from forms of lower 
level N/p? by a straightforward operation, and are therefore not “genuinely” 
of level N. They are the analogues, in the context of modular forms, of non- 
primitive Dirichlet characters. 


Definition 1.21 We define the old subspace of S2(T) to be the space spanned 
by those functions which are of the form g(az), where g is in Sa(Tı(M)) for 
some M < N and aM divides N. We define the new subspace of S(T) 
to be the orthogonal complement of the old subspace with respect to the 
Petersson scalar product. A normalized eigenform in the new subspace is 
called a newform of level N. 


The following result is the main consequence of the theory of Atkin-Lehner. It 
gives a complete answer to the question of what is the structure of the algebra 
T acting on S(T). 

Theorem 1.22 If f is in the new subspace of S(T‘) and is an eigenvector for 
all the operators in T°, then it is also an eigenform for T, and hence is unique 


up to scaling. More generally, if f is a newform of level Ny|N, then the space 
Sr defined by 


Ss = {g € S(T) such that Tg =2,;(T)g, for all T € T°} 


is stable under the action of all the Hecke operators in T. It 1s spanned by the 
linearly independent forms f(az) where a ranges over the divisors of N/N}. 
Furthermore, we have 

S2(T) = fS, 
where the sum is taken over all newforms f of some level Ny dividing N. 


See [AL] for the proof in the case r = Fo(N), and [La2], ch. VIII for the 
general case. (See also [DI], sec. 6 for an overview.) 


Exercise 1.23 Consider the case where [ = I'9(22). Show that X(22) is of 
genus 2, and hence that S2(22) has dimension 2. Show that $(22) is equal 
to Sp, where f = (n(7)n(117))? is a newform of level 11, so that in particular 
there are no newforms of level 22 on I. Show that T° is isomorphic to C in 
this case, and that T is isomorphic to the semisimple algebra C x C. 
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Action on homology and Jacobians: Note that the Hecke operators act 
on V = S(T)“ by duality. One checks (cf. [Kn], props. 11.23, 11.24) that the 
sublattice A of V is stable under the action of all the Hecke operators Th, and 
of the diamond operators (d). Therefore the operators Ta and (d) give rise 
to endomorphisms of the torus V/A, and hence the Jacobian variety Jr, in 
a natural way. The involution T ++ —7 gives rise to an involution on Xp(C) 
(which is complex conjugation on the model of Xp over R deduced from the 
Q model defined in section 1.5). Since complex conjugation is continuous it 
also acts on the integral homology A = Hı(Xr(C), Z). Let At and A` be the 
sublattices of A on which complex conjugation acts by +1 and —1. These are 
sublattices of A of rank g which are stable under the Hecke operators, since 
complex conjugation commutes with the Hecke action. 

A more algebraic description of the action of T, on Jp is given via the 
notion of an algebraic correspondence. A correspondence on a curve X is a 
divisor C on X x X which is taken modulo divisors of the form {P} x X and 
X x {Q}. Let mı and m2 denote the projections of X x X onto each factor. 
Then the correspondence C induces a map on divisors of X, by setting 


C(D) = ta(n] (D) -C). 


(For the definition of the intersection Dı - Da of two divisors, see [Wel].) The 
map C preserves divisors of degree 0, and sends principal divisors on X to 
principal divisors. It gives a well defined algebraic endomorphism of the Jaco- 
bian variety Jac (X). Given a correspondence C, its transpose C” is defined to 
be the divisor of X x X obtained by interchanging the two factors of X x X. 
One can define a natural notion of composition for correspondences, and the 
set of correspondences forms a ring. The general theory of correspondences 
and the proofs of the above facts are given in [Wel], particularly the second 
chapter. 

The Hecke correspondence T, is defined as the closure in Xp x Xp of the 
locus of points (A, B) in Yr x Yr such that there is a degree n cyclic isogeny of 
elliptic curves with [-structure from A to B. For example, if p is a prime not 
dividing N, then T, is an algebraic curve in X(N) x X(N) which is birational 
to Xp,()nro(p): The induced map on divisors in this case satisfies 


T,((E, P)) = X _(E/C, P mod C) 


where the sum runs over the subgroups C of E having order p. Note also that 
if (A, B) belongs to Tp, then the isogeny dual to A — B gives a p-isogeny 
from B to (p)A, so that 


1.4 The L-function associated to a cusp form 


For this section, let f in Sə(Tı(N)) be a cusp form with Fourier expansion 
at ioo given by )>,, dng”. One has the following estimate for the size of the 
Fourier coefficients an: 


Theorem 1.24 The coefficients a, € C satisfy the inequality 


Jan| < e(f)oo(n) Vn, 


where c(f) is a constant depending only on f, and co(n) denotes the number 
of positive divisors of n. 


Sketch of proof: This follows from proposition 1.51 of section 1.7 which relates 
the p-th Fourier coefficients of eigenforms, for p a prime not dividing the level 
of T, to the number of points on certain abelian varieties over the finite field 
Fp. The estimates of Hasse and Weil for the number of points on abelian 
varieties over finite fields (stated in theorem 1.4 of section 1.1 for the special 
case of elliptic curves; see [Wel], §IV for the general case) thus translate into 
asymptotic bounds for the Fourier coefficients of these eigenforms. We note 
that the cruder estimate |an| = O(n), which is enough for the purposes of this 
section, can be derived by a more elementary, purely analytic argument; cf. 
[Ogg], ch. IV, prop. 16. o 
The L-function associated to f is defined by the formula: 


L(f,s) = ` ann’. 


As in exercise 1.8, one can show that the infinite sum defining L(f, s) converges 
absolutely in the right half-plane Re (s) > 3. A much better insight is gained 
into the function L(f, s) by noting that it is essentially the Mellin transform of 
the modular form f. More precisely, if we set A( f, s) = N*/?(2r)~*I'(s)L(f, s), 
then we have 


As) = N°? | playa /y (1.4.1) 
0 
Exercise 1.25 Check the formula above. 
This integral representation for L(f,s) gives the analytic continuation of 
L(f,s) to the entire complex plane. The modular invariance of f translates 


into a functional equation for L(f,s): more precisely, let wy be the Atkin- 
Lehner involution defined by wy(tT) = —1/Nr. The reader may check that 
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wy induces an involution of Xp, and hence of 01(Xp) = $(I). One finds that 
L(f,s) satisfies the functional equation: 


Alf, s) = —A(ww(f), 2 — 8). 


For a proof of this, see [Ogg], ch. V, lemma 1. Eigenforms for T in S2(N, x) 
have a great importance in the theory because their associated L-functions 
have an Euler product expansion, in addition to an analytic continuation and 
functional equation: 


Theorem 1.26 If f = >> anq” is a normalized eigenform in S(N, x) for all 
the Hecke operators, then the L-function L(f,s) = X ann™® has the Euler 
product expansion 


I] (1 — app ™® + x(p)p 78)? Jfa — app $)". 
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Proof: This follows directly from equation (1.3.1) of section 1.3. m 

If f is a newform of level N, then it is also an eigenform for wy, so that the 
functional equation may be viewed as relating L(f,s) and L(f,2— s). We can 
also state the following more precise version of theorem 1.24 (see lemma 3.2 
of [Hi2] for example for parts (b), (c) and (d)). 


Theorem 1.27 Suppose that f is a newform of level Ny and let N, denote 
the conductor of its character x. 


(a) If p does not divide Ny then |ap| < 2,/p. 


(b) If p|N; and p does not divide Ny then a? = xo(p) where xo is the 
primitive character associated to X. 


(c) If p divides Ny and p does not divide N;/Ny then |ap| = \/p. 
(d) If p° divides Np and p divides N/N; then ap = 0. 


1.5 Modular curves and modular forms over Q 


Modular curves: For I between ['9(N) and F(N), the modular curve Xp 
has a model over Q. We describe such a model in the case of IT = ro(N); the 
construction for general I follows from similar considerations. 

The key remark here is that, as was noted in section 1.2, the complex 
points on the curve Yo(.V) have a natural interpretation as moduli of elliptic 
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curves together with a cyclic subgroup of order N, given by sending the point 
T €H/To(N) to the pair (C/(1,7), (1/N)). 

Consider the “universal elliptic curve” 

36 1 
J= 7 —1728° 
It is an elliptic curve over the function field Q(j), with j-invariant j. Let d be 
the order of P!(Z/NZ), and let Ci,... , Ca denote the set of all cyclic subgroups 
of E; of order N, defined over Q(j), an algebraic closure of Q(j). Fix one of 
these subgroups, C. The Galois group Gal (Q(j)/Q(j)) permutes the C; in a 
natural way. Let Fy be the smallest extension of Q(j) (viewed as embedded in 
Q(j)) with the property that o(C) = C, for all o € Gal(Q(j)/F wn). It can be 
seen (cf. [Shi2], thm. 6.6) that the Galois action on the C; is transitive so that 
Fy/Q(j) is of degree d, and that it is a regular extension, i.e., Fy NQ = Q. 
Geometrically, Fy can be viewed as the function field of a curve X/Q over Q, 
with the inclusion of Q(j) corresponding to a map from Xq to the projective 
“j-line” over Q. The pair (E;, C) is an elliptic curve over Q(j) with a subgroup 
of order N defined over Fy. Using (£;,C), each complex point of X gives an 
elliptic curve over C with a subgroup of order N, provided the point does 
not lie over j = 0, 1728 or co. The resulting map to Xoọ(N) extends to an 
isomorphism from X to Xo(N). The curve X thus constructed, together with 
this identification, is the desired model of Xo(N) over Q. 

More concretely, the functions j = j(7) and jy = j(NT) are related by a 
polynomial equation ®y(j, jn) = 0 with coefficients in Q, of bidegree d. The 
field Fy is the function field of the affine curve ®y(X, Y) = 0, and the mapping 
T => (j(T),j(NT)) gives a birational equivalence between H/To(N) and the 
complex curve defined by the equation ®y. In practice it is not feasible to 
write down the polynomial ®y, except for certain very small values of N. To 
study the models over Q of Xo(N), more indirect methods are needed, which 
rely crucially on the moduli interpretation of Xo(V). Similar remarks hold for 
X(N). 


Models over Z: The work of Igusa [Ig], Deligne-Rapoport [DR], Drinfeld [Dr], 
and Katz-Mazur [KM] uses the moduli-theoretic interpretation to describe a 
canonical proper model for Xr over Spec Z. These models allow us to talk 
about the reduction of Xp over finite fields F,, for p prime. The curve has 
good reduction at primes p not dividing N, with the “non-cuspidal” points of 
Xr/r, corresponding to elliptic curves over F, with I-structure. The singular 
fibers at primes p dividing N can also be described precisely; an important 
special case (see [DR]) is that of Po(NV) with p exactly dividing N. For further 
discussion and references, see [DI], sec. 8, 9. 


E; : y? + zy = r? — 
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From now on, when we write Xp, Xo(N), or X(N), we will mean the curve 
over Q which are the models described above for the complex curves defined 
in section 1.2. 


Remark 1.28 When considering g-expansions, it is more convenient to use 
a different set of models over Z for these complex curves. We define X¥(N) 
in the case of Tı(N) as a model over Z which parametrizes pairs (Ei) where 
i is an embedding of uyn in the (generalized) elliptic curve Æ. (So X#(N) is 
the model denoted X,,(V) in [DI], sec. 9.3, assuming N > 4.) For I between 
To(N) and Ty (V), we define Xf as the corresponding quotient of Xj'(N). This 
model has good reduction at primes p not dividing N, but unlike the models 
mentioned above, its fibers at primes p dividing N are smooth and irreducible, 
but not proper. In the case of T = P(N), the curve Xf go can be identified 
with Xo(N). However, this is not the case in general: the cusp œ is a rational 
point of Xf but not necessarily of Xr. 


Jacobians: Weil’s theory [Wel] of the Jacobian shows that the Jacobians Jr 
defined in section 1.2 as complex tori also admit models over Q. When we 
speak of Jp, Jo( N) and J,(N) from now on, we will refer to these as abelian 
varieties defined over Q. Thus, the points in Jr(K), for any Q-algebra K, are 
identified with the divisor classes on Xr of degree 0, defined over K. 

We let Jr/z, denote the Néron model of the Jacobian Jr over Spec (Z). 
Using this model we define Jp;, for arbitrary rings A. In particular we can 
consider Jr/r,, the reduction of the Jacobian in characteristic p, which is closely 
related to the reduction of the integral model of the curve Xp mentioned above. 
In particular, if p does not divide the level of I, then Jp/p, can be identified 
with the Jacobian of Xr/r,. For a treatment of the case = Io(N) with p 
exactly dividing N, see the appendix of [Maz1]; for more general discussion 
and references, see [DI], sec. 10, especially sec. 10.3. 


Hecke operators: The Hecke operators have a natural moduli interpretation, 
which was already touched upon in section 1.3. In particular, one finds that 
the operator T, arises from a correspondence which is defined over Q, and 
gives rise to an endomorphism of the Jacobian Jp which is defined over Q. 
This in turn gives rise to an endomorphism of the Néron model Jp;z, and we 
can then consider the endomorphism T, on the reduction of the Jacobian in 
characteristic p. Recall that if p is a prime not dividing N, we may identify 
this reduction with the Jacobian of Xrm,- (Cf. [MW], ch. 2, sec. 1, prop. 2.) 
Furthermore, one can show that the moduli-theoretic interpretation of the 
Hecke operator remains valid in characteristic p; i.e., the endomorphism Tn of 


36 


Jr/r, is induced by a map on divisors satisfying, for all ordinary elliptic curves 


A with [-structure: 
T,(A) = $ i(A), 

where the sum is taken over all cyclic isogenies of degree n. (See [DI], sec. 10.2 
for further discussion and references. ) 

This description allows one to analyse, for example, the Hecke operator 
Tp over Fp, when (p,N) = 1. Let us work with T = T(N), to illustrate 
the idea. For a variety X over F,, let dx be the Frobenius morphism on X 
defined by raising coordinates to the pth power. Thus if (E, P) corresponds 
to a point of X1(V)/z,, then ġp is an isogeny of degree p from (E, P) to the 
pair (Eœ, Poo) = $x,(n)(E, P). The graph of ¢x,() in (X1(N) x X1(N))/r, is 
a correspondence of degree p, which we call F. Let F’ be the transpose of this 
correspondence. The endomorphism F of Jp induced by F is the Frobenius 
endomorphism @¢,,, and the endomorphism F” is the dual endomorphism (in 
the sense of duality of abelian varieties). Now consider the divisor 


F'((E, P)) = (E1, Pi) +-+- + (Ep, Pp), 


where the (E;, P;) are elliptic curves with [-structure in characteristic p. Since 
og, is an isogeny of degree p from (Fj, P;) to (E, P), we also have the dual 
isogeny from (E£,P) to (£;,pP;). If E is ordinary at p, then the points 
(Exc, P»), (E1,pPi), ..., (Ep, pP,) are a complete list of the distinct curves 
with [’-structure which are p-isogenous to (E, P). Hence one has the equality 
of divisors on Xı(N)/r,: 


T,((B, P)) = (Eo, Poo) + (Ei, pP1) + +++ + (Ep, pPp) = (F + (p) F’)((E, P)). 


Since the ordinary points are dense on Xı(N)/r,, we deduce that T, = (F + 
(p) F’) as endomorphisms of J,(N),,. This equation, known as Eichler- 
Shimura congruence relation, plays a central role in the theory. (For more 
details, see [DI], sec. 10.2, 10.3.) 


Theorem 1.29 If pjN then the endomorphism T, of Jrj, satisfies 
Tp = F + (p)F". 


Remark 1.30 This was proved by Eichler [Ei] to hold for all but finitely many 
p in the case of ro(N), and by Shimura ([Shil], see also [Shi2], ch. 7) in the 
case of [,(.V). The fact that it holds for all p not dividing N follows from 
work of Igusa [Ig]. 
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Modular forms: In the same way that modular curves have models over 
Q and over Z, the Fourier coefficients of modular forms also have natural 
rationality and integrality properties. We start by sketching the proof of: 


Theorem 1.31 The space S(T) has a basis consisting of modular forms with 
integer Fourier coefficients. 


Proof: The Hecke operators act on the integral homology A* in a way that 
is compatible with the action on S(T) and respects the natural (Poincaré) 
duality between these two spaces. Hence, if {An }nen is a system of eigenvalues 
for the Tn, then the A, are algebraic integers in some finite extension K of Q, 
and the system {A2 }nen is a system of eigenvalues for the T, for any Galois 
automorphism c of Q/Q. Hence, we have shown: 


Proposition 1.32 If f € S2(M,x) is a newform of some level M dividing 
N, then its Fourier coefficients lie in a finite extension K of Q. Moreover, 
ifo € Gal(Q/Q) is any Galois automorphism, then the Fourier series f° 
obtained by applying o to the Fourier coefficients is a newform in S2(M, x7). 


The explicit description of S(T) given in section 1.3 implies that S2([) is 
spanned by forms having Fourier coefficients which are algebraic integers in 
some finite (Galois) extension K of Q, and that the space of forms with Fourier 
coefficients in K is stable under the natural action of Gal (K/Q) on Fourier 
expansions. An application of Hilbert’s theorem 90 shows that S(T) has a 
basis consisting of forms with rational Fourier expansions, and the integrality 
of the Fourier coefficients of eigenforms yields the integrality statement of 
theorem 1.31. 

We define S2(T, Z) to be the space of modular forms with integral Fourier 
coefficients in S2(T). Theorem 1.31 states that S2(T, Z) & C = S(T). Given 
any ring A, we define 


S(T, A) = S&T, Z) 8 A, 


and define S2(N, A) and S5(N,x, A) (where x now is a character with values 
in A*) in the obvious way. If A is contained in C, the q-expansion principle 
below allows us to identify S(T, A) with the set of modular forms in S(T) 
with Fourier coefficients in A. 


The q-expansion principle: Because the modular curve Xo(NV) has a model 
over Q, the space of modular forms S2(N) = 01(X (N)) has a natural rational 
structure, given by considering the differential forms on Xo(N) defined over 
Q. The fundamental g-expansion principle (see [DR], ch. 7, or [Kat], sec. 1.6) 
says that these algebraic structures are the same as those obtained analytically 
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by considering g-expansions at oo. More generally, using the model Xf, we 
obtain the q-expansion principle over Z for cusp forms on F (cf. [DI], sec. 12.3). 


Theorem 1.33 The map S(T) — Q!(Xp) defined by f œ wy induces an 
isomorphism from S(T, Z) to Q!(XE). Furthermore, if A is flat over Z or N is 
invertible in A, then the induced map S2(T, A) > Q! (XÈ 4) is an isomorphism. 
Furthermore, if A is a subring of C, then this isomorphism identifies S(T, A) 
with set of modular forms in S(T) having coefficients in A. 


1.6 The Hecke algebra 


It follows directly from the formulas for the Hecke operators acting on q- 
expansions that the T, leave S(To(N), Z) stable, as well as the subspace 
of Sə(N,x) with coefficients in Z[x]. Using the g-expansion principle (the- 
orem 1.33), one can also show ([DI], sec. 12.4) that the diamond operators 
preserve the spaces of cusp forms on I with integral Fourier expansions, and 
hence that the space S(T, Z) is preserved by all the Hecke operators. 

We define Tz to be the ring generated over Z by the Hecke operators Tj, 
and (d) acting on the space S(T, Z). More generally, if A is any ring, we define 
T4 to be the A-algebra Tz & A. This Hecke ring acts on the space S(T’, A) 
in a natural way. Before studying the structure of the Hecke rings T4 as we 
vary the rings A, we note the following general result (Cf. [Shi2], ch. 3.): 


Lemma 1.34 The space S(T, A)Y = Hom4(S2(T, A), A) is a free T4-module 
of rank one. 


Sketch of proof: One checks that the pairing Tz x S(T, Z) — Z defined by 
(T, f) + a(Tf) sets up a perfect, Tz-equivariant duality between Tz and 
S(T, Z). The result for arbitrary A follows. 


Hecke rings over C: If A = C, then the structure of the ring T = Tc 
is completely described by theorem 1.22. More precisely, if Tc, denotes the 
image of the Hecke algebra acting on the space Sp defined in section 1.3, then 


Tc = OT cy, 


where the direct sum ranges over all distinct newforms f of some level Nẹ 
dividing N. Furthermore, the algebra Tc, can be described explicitly. In 
particular, if f is a newform of level N then Tc, is isomorphic to C, but if Nẹ 
is not equal to N then the ring Tc,¢ need not be a semi-simple algebra over C. 

Lemma 1.34 in the case A = C says that V = S(T)“ is a free Tc-module 
of rank one, but we also have: 
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Lemma 1.35 The module S(T) is a free Tc-module of rank one. 


Proof: Let gi,... ,9: be a complete system of newforms of levels Ny, ..., Ni 
dividing N. One can check that the form 


g = (N/T) +--+ + al N/M) 


generates S(T) as a Tc-module. The map Tc — S(T) defined by T — Tg 
gives an isomorphism from Tce to S2(T), as Tc-modules. 


Remark 1.36 Lemmas 1.34 and 1.35 imply that Tc is a Gorenstein C-algebra 
(of finite rank), i.e., Homc(Tc, C) is isomorphic to Tc as a Tc-module. 


Hecke rings over Q: Let [f] be the Galois orbit (under the action of Gg = 
Gal (Q/Q)) of a normalized newform f of some level Ny dividing N, and let 
Ky be the field extension of Q generated by the Fourier coefficients of f. The 
space Dye), is a vector space of dimension [Ky : Q]loo( N/N), which is 
spanned by modular forms with rational Fourier coefficients. Let Sif) be the 
Q-subspace of forms in ®ygeif)S, with rational Fourier coefficients. The space 
Sty) is stable under the action of Tg, and letting Tos) be the image of To 
acting on Sif), we obtain the direct sum decomposition 


To = Of Tose: 


where the sum is taken over the distinct Gg-orbits of normalized newforms 
f of some level Ny dividing N. If Ny is equal to N, then the algebra To jf) 
is isomorphic to the field Kp. If Ny is a proper divisor of N, then, as in the 
complex case, the algebra Tg jf) is a (not necessarily semi-simple) algebra over 
Q of rank o9(N/Ny) [Ky : Q]. The nature of the fields Ky, and in general the 
structure of Tg, is very poorly understood at this stage; for example, one does 
not know how to characterize the number fields that occur as a Kp for some 
f (but they are all known to be totally real or CM fields). 

The ring Tg acts naturally on the rational homology Hi(Xr,Q) =A@Q, 
and we have 


Lemma 1.37 The module A ® Q is free of rank two over Tg. 


Sketch of proof. The modules At @ C ~ V and A~ & C ~ V are free of rank 
one over Tc, by lemma 1.34. This implies that At @ Q and A~ & Q are both 
free of rank one over Tg. 
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Hecke rings over Z: The ring Tz is a certain (not necessarily maximal) order 
in Tg. One still has an injection 


Tz > OT zr) 


where now Tz p denotes the ring generated over Z by the Hecke operators 
acting on Siy- Of course the structure of Tz is even more mysterious than 
that of Tg! The ring Tz acts naturally on A, but it is not the case in general 
that A is free of rank two over Tz, i.e., that the integral analogue of lemma 1.37 
is true. (See remark 1.42 below.) 


Hecke rings over Q;: The study of the algebras Tz, and Tg, arises naturally 
because of the Hecke action on the Tate module 7;(Jr), 


Ti(Jr) := lim(Jr) [e], 


where the inverse limit is taken with respect to the multiplication by @ maps. 
The action of Tz, on T;(Jr) is compatible with that of Go, and it is this pair of 
actions on the Tate module which is used to associate two-dimensional Galois 
representations to modular forms. 

It will sometimes be more convenient to consider the ring Tg, and its action 
on 


V=Ti(Jr) ®z, Qe- 


We first record a useful duality property enjoyed by the Tate modules. The 
Weil pairings on the groups Jp|[¢"] for n > 1 induce a perfect pairing 


(, ): (Jr) x Te( Jr) > Ze. 


Since each Hecke operator T is adjoint to wTw where w = wy is the Atkin- 
Lehner involution, we have the following lemma. 


Lemma 1.38 The map x +> ¢, where sly) = (x, wy) defines an isomor- 
phism of Tz,-modules, 


Te( Jr) = Te( Jr)” = Homz,(Te( Jr), Ze), 
and hence an isomorphism of Tg,-modules 
V = VY = Homo, V, Q). 


The following lemma allows us to regard Tg, as a coefficient ring for a 
two-dimensional Galois representation 


Gal (@/Q) > Aut r, (V) = GLo(Tg,). 
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Lemma 1.39 The module V is free of rank 2 over To,. 


Proof. Lemma, 1.37 implies directly that the module V = A & Q; is free of rank 
two over Tg,. 


Corollary 1.40 The ring To, is a Gorenstein Qy-algebra; i.e., 
To, = Homa, (Te,, Qe) 

is free of rank one over To,. 

Proof. Choosing a basis for V over Tg,, we obtain an isomorphism 


Te, D Te, = To, ® To- 


Decomposing Tg, as Į [; R; where each factor R; is a finite-dimensional local 
Q;-algebra, we obtain an isomorphism 


R; © R; = RY @ RY 


for each i. At least one of the four maps RY —> R; deduced from this 
isomorphism must be surjective, and by counting dimensions, we see that it 
must be injective as well. It follows that Tg, is isomorphic to TG,. 

Recall that for primes p not dividing N, the Jacobian Jp has good reduction 
mod p, and the Eichler-Shimura relation, theorem 1.29, states that on Jr/r,, 
we have 


Ip + (py 


For primes p not dividing N£, we may identify 7;(Jp) with the @adic Tate 
module of the reduction (see [ST]) and consider the Frobenius endomorphism 
F on the free rank two Tg,-module V. As a consequence of the Eichler-Shimura 
relation, we find: 


Theorem 1.41 For p not dividing N£, the characteristic polynomial of F on 
the Tg,-module V is 
X? — T,X + (pp. 


Proof: (We are grateful to Brian Conrad for showing us this argument.) Since 
FF" = p, it follows from the Eichler-Shimura relation that 


F? —T,F + (p\p = 0. 


To conclude that this is in fact the characteristic polynomial, it suffices to 
compute the trace of F. To do so, we use the Tg, isomorphism 


pay 
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defined by the modified pairing (-,w-). Under this modified pairing, F is 
adjoint to wF'w = (p)F”’, so the trace of F on V is the same as that of 
b+ ġo ((p)F’) on VY. Choosing bases for V and Tg,, one sees that this is 
the same as the trace of (p) F”. Hence 


2tr F =tr F + tr ((p)F’) = tr (Tp) = 2 Tp. 


Hecke rings over Z,: The ring Tz, is free of finite rank over Zy. It therefore 


decomposes as 
Tz, = I] Trw 


where the product runs over the maximal ideals m of Tz,, and Tm is the 
localization of Tz, at m. For each m, Tm is a complete local Zy-algebra, free 
of finite rank as a Z,-module. 


Remark 1.42 While the analogue of lemma 1.39 does not always hold for 
Tz, (see [MRi], sec. 13) we shall see that it holds for certain localizations Tm. 
Results of this type are much deeper than lemma 1.39 and were first obtained 
by Mazur [Maz1], sec. 14, 15. We shall return in chapter 4 to explain Mazur’s 
result and its generalizations, which play a role in the arguments of [W3] and 
[TW]. 


Example 1.43 The curve X0(19) has genus 1, and Xo(57) has genus 5. By 
consulting the tables in the Antwerp volume [Ant4] or Cremona’s book [Cr], 
one finds that there is exactly one newform of level 19, and that there are 
three newforms of level 57, which all have rational Fourier coefficients. Their 
Fourier coefficients a,, for the first few primes p, are listed in the following 
table: 


2) 3) 5| 7| 11| 13| 17| 19| 23| 29] 31 
19A| 0|—2| 3|—1! 3])-4/-3] 1] 0 6) —4 
57A | —2 | —1 | —3 | —5 —1|—1|—4| —2|—6 
57TB| 1} 1|—2| OF; 0| 6/-6;-1] 4 2| 8 
57C |—2| 1| 1| 3)/-3]-6]/ 3|—1| 4/-10) 2 


bä 
N 


Setting f = 19A, we find that a basis of simultaneous eigenforms in S2(57, C) 
for the Hecke operators Tp, | # 3,19, and U3, U19 is: 


fig) fer, fo) FU —vV7=2) fr), STA, 57B, 57C. 


It appears from the table that the Fourier coefficients corresponding to the 
forms 57B and 57C are congruent modulo 3. This is in fact the case. One 
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finds that the Hecke ring Tz, generated over Z3 by the Hecke operators acting 
on S2(57, Z3) is isomorphic to the subalgebra of Z$: 


{(z,y, 2,t,w) such that t=w (mod 3)}. 
The isomorphism sends T, (for p 4 3,19) to the element 
(ap(19A), a,(19A), a,(57A), ap(57 B), ap(57C)). 


It sends U3 to (—1 + v—2,—1 — V—2,—1,1,1) and Uig to (1,1, -1, -1, —-1). 
Thus there are exactly 4 distinct maximal ideals of Tz,, and the localizations 
at these maximal ideals are the rings Zs, Z3, Z3, and 


Tm = {(t, w) such thatt =w (mod 3)}, 


where m is the ideal generated by 3 and Tn — a,(57B) for all n. 


1.7 The Shimura construction 


Let f = X` anq” be an eigenform on F with (not necessarily rational) Fourier 
coefficients, corresponding to a surjective algebra homomorphism Aş : To — 
Ky, where Ky is the field generated over Q by the Fourier coefficients of f. We 
briefly review in this section a construction of Shimura ([Shi2], ch. 7) which 
associates to f (or rather, to the orbit [f] of f under Gg) an abelian variety 
A, defined over Q and of dimension |K; : Q]. 

Let I, C Tz be the ideal ker (Ay) O Tz. The image Ip(Jp) is a (connected) 
subabelian variety of Jp which is stable under Tz and is defined over Q. 


Definition 1.44 The abelian variety Aş associated to f is the quotient 
Ay = Jr/IyJr 


From this definition one sees that Aş is defined over Q and depends only on 
|f], and that its endomorphism ring contains Tz/J which is isomorphic to an 
order in Ky. 


Remark 1.45 Using theorem 1.22, one can show that Jo(V) is isogenous to 


oo(N/N) 
Lia Aur 


We now describe the abelian variety Ay as a complex torus. Let Vy be the 
subspace of V = S(T)” on which T acts by A. Theorem 1.22 and lemma 1.34 
show that Vp is a one-dimensional complex vector space. Let my be the or- 
thogonal projection of V to Vy, relative to the Petersson scalar product. The 
projector 7» belongs naturally to Tx,. 
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Let [f] be the set of all eigenforms whose Fourier coefficients are Galois 
conjugate to those of f. The number of forms in [f] is equal to the degree d 
of K; over Q. Now, we set 


Vin. = Bse Vo Tu = >, 


Tg: 
gelf] 


Note that m; is simply the orthogonal projection of V to Vig. Note also that 
Tis) belongs to the Hecke algebra To. 


Lemma 1.46 The abelian variety Ay is isomorphic over C to the complex 
torus Vip)/mp(A), with the map mp: V/A — Vip)/m (A) corresponding to 
the natural projection from Jr to Af. 


In particular, one sees that Ay is an abelian variety of dimension d = |v; : QJ. 
Hence if f has rational Fourier coefficients, then the abelian variety A; is an 
elliptic curve. This elliptic curve is called the strong modular elliptic curve 
associated to f if also f is a newform of level N and F = To(N). 


Example 1.47 If T = [9(26), one checks that the genus of Xp is two, and 
that there are two distinct normalized eigenforms fı and fə in S2(26). From 
the tables in [Ant4] or [Cr], one sees that fı and fə have integral Fourier 
coefficients, whose values for the primes < 31 are: 


| 2| 3] 5| 7] 11] 13] 17/19] 23|29| 31 
-1 —3]-1] 6] 1/-3] 2] 0] 6/-4 
HESEL TR -— 4 


[= 


fi 
f 


Hence the abelian varieties Ay, and Ap, are elliptic curves. The above table 
suggests (and this can be checked directly by looking at the equations for these 
curves given in [Ant4] or [Cr], or by using the discussion in [DO], lemma 2.1) 
that the Fourier coefficients of fı and fo are congruent modulo 2. The natural 
projection Jo(26) — Aj, 6 Af is an isogeny whose kernel is isomorphic to 
Z/2Z x Z/2Z, and Jo(26) is not isomorphic to As, 6 Af. More generally, one 
knows that a Jacobian can never decompose as a non-trivial direct product of 
two principally polarized abelian varieties, cf. [Maz1], prop. 10.6. (There are 
no non-zero homomorphisms from Ay, to Ap, and so a non-trivial product 
decomposition of Jr would have to induce a decomposition as a product of 
principally polarized abelian varieties.) 


Let (Ap) be the Tate module of the abelian variety Ar, 
TAs) := lim( Ap IC") 
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where the inverse limit is taken with respect to the multiplication by @ maps. 
This module is naturally a module for T; ® Z; , and (Af) ® Q; is a module 
for K; ® Q as well. 


Lemma 1.48 The module Tp(As)®Q is a free module of rank 2 over K ,®@Qy. 


Proof: This follows directly from lemma 1.39. 


Proposition 1.49 The algebra End g(Ays) ® Q is isomorphic to Kr. In par- 
ticular Ay is simple over Q. 


The proof is given in [R2], cor. 4.2. The main ingredient is the irreducibility of 
the Galois representation attached to Ay as in section 3.1. (Cf. theorem 3.1.) 


Properties of As: good reduction: 


Theorem 1.50 If f is an eigenform of level N, and p is a prime that does 
not divide N, then the abelian variety Ay has good reduction at p. 


Proof. This follows from the fact that Jp has good reduction at such primes, 
which in turn is a consequence of the good reduction of Xp. 

If p is a prime not dividing N, it then becomes natural to study the number 
Ny,» of points on the abelian variety Ay over the finite field F,. It is given by 
the following formula: 


Proposition 1.51 The number of points Ny» is given by the formula 
Nyp = Normg,/q(A¢(1 — apl f) + (p)p)), 
where a,(p) € Ky is the p-th Fourier coefficient of f. 
Proof: By Weil’s theory [Wel], this number is given by the determinant 
Nyy = det(1 — F), 


where F' is the Frobenius endomorphism acting on the -adic Tate module 
T,(A;). So theorem 1.41 implies 


det (1 = F) = Normg,/9(As(1 = Tp + (p)p)), 


and proposition 1.51 follows. Oo 
Defining the local Hasse-Weil L-function of Ay over F, by the formula 


L(A;/Fp,8) = det(1— Fp™*)~", 
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the proof above gives the formula: 


L(As/Fp, 8) = |] Lp(f7, 5), 


where the product is taken over all complex embeddings o : Ky — C, and 
L,(f%, 8) is the Euler factor at p in the L-function that was associated to f7 
in section 1.4. 

In particular, if Ay = E is an elliptic curve, i.e., f has rational Fourier 
coefficients and f is on Po(N), then the number of points on E over F, is 
given by the formula 


#E(F)) = p+1—a,(f). (1.7.1) 


Properties of Ay: bad reduction: A fundamental quantity associated to 
any abelian variety A over Q is its arithmetic conductor, which measures the 
amount of bad reduction of A. If we factor this conductor as a product of 
prime powers, IL, p’», then the exponents m, are equal to 0 precisely when 
A has good reduction at p. The definition of m,(A) was already given in 
section 1.1 when A is an abelian variety of dimension 1, i.e., an elliptic curve. 
In general, the exponent m, coincides with m,(p4,2) (see section 2.1 below), 
where pay is the Galois representation on the f-adic Tate module of A. 

Regarding the bad reduction of the abelian variety A p, one has the following 
consequence of the results of Langlands, Deligne and Carayol [Ca3] discussed 
below in section 3.1. 


Theorem 1.52 Iff is a newform of level N, then the conductor of the abelian 
variety Ay is equal to N°. 


1.8 The Shimura-Taniyama conjecture 


Let E be any elliptic curve defined over Q, and let N denote its arithmetic 
conductor, defined as in section 1.1. Then we have: 


Proposition 1.53 The following are equivalent: 


(a) The curve E is isogenous over Q to Ay, for some newform f on some 
congruence group I. 


(b) The curve E is isogenous over Q to Af, for a newform f on To(N). 


(c) There is a non-constant morphism defined over Q, from Xo(N) to E. 
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Sketch of proof. The implication (b) = (a) is immediate, and (a) = (b) follows 
from the work of Carayol [Ca3] (cf. proposition 3.20 below). If (b) holds, then 
there is a surjective map Jo(N) — E. Composing with the Abel-Jacobi map 
Xo(N) — Jo(N) gives a map to E which is not constant, by the definition of 
the Jacobian, and hence (c) holds. Conversely, a non-constant map of curves 
T : Xo(N) — E induces, by Albanese (covariant) functoriality, a surjective 
map of Jacobians m, : Jo( N) — E, where we have identified Æ with its own 
Jacobian in the natural way. Since Jo(NV) is isogenous to a product (possibly 
with multiplicities) of abelian varieties Ay where f run over newforms of level 
Ny dividing N (cf. remark 1.45), it follows that there is a surjective map 
Aş — E for some Af. Since A, is simple (proposition 1.49), this map is an 
isogeny, and part (b) follows. o 

We call an elliptic curve over Q satisfying the equivalent properties above 
a modular elliptic curve. A startling conjecture that was first proposed by 
Taniyama in the 1950’s and made more precise by Shimura predicts that the 
Shimura construction explained in the previous section is surjective: 


Conjecture 1.54 All elliptic curves defined over Q are modular. 


This conjecture is now known to be true for a very wide class of elliptic curves. 
The results of Wiles [W3] and Taylor-Wiles [TW] imply it is true for all semi- 
stable elliptic curves, and a strengthening of the method, [Di2], implies the 
conjecture for all elliptic curves which have semi-stable reduction at 3 and 5. 


Remark 1.55 See [DI], sec. 13 for a more thorough list of equivalent forms 
of the conjecture. 


Relation with L-functions: Define numbers a, = a,(£) as in section 1.1. 
Recall the (global) Hasse-Weil L-function defined in section 1.1 by equations 
(1.1.2), (1.1.4), (1.1.5), and (1.1.6). The following proposition reveals some of 
the importance of the Shimura-Taniyama conjecture: 


Proposition 1.56 If E is modular, i.e., is associated to a newform f by the 
Shimura construction, then 


L(E/Qs) = L(f, s). 


In particular, L(E/Q,s) has an analytic continuation to the entire complex 
plane, and a functional equation. 
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Sketch of proof: If E is isogenous to an elliptic curve Ay associated to a 
newform f on ['9(N) by the Shimura construction, then the two L-functions 
L(A;/Q, 8) and L(E/Q,s) are equal. On the other hand, formula (1.7.1) 
directly implies that L(As/Q,s) is equal to L(f,s), at least up to finitely 
many Euler factors (corresponding precisely, by the work of Igusa, to the 
primes dividing N). The equality of all the Euler factors follows from the 
work of Deligne, Langlands, and Carayol (cf. [Cal]). 

Knowing the analytic continuation and functional equation of the Hasse- 
Weil L-function of an elliptic curve is of great importance in the theory. For 
example, the conjecture of Birch and Swinnerton-Dyer (which was stated in 
a weak form in section 1.1, conjecture 1.9) relates arithmetic invariants of Æ 
such as the rank of its Mordell-Weil group and the order of its Shafarevich- 
Tate group to the behaviour at s = 1 of L(E/Q,s). (Note that s = 1 is 
outside the domain of absolute convergence of the infinite product used to 
define L(E/Q, s).) 

One consequence of this conjecture is that E(Q) is finite if L(E/Q, 1) is 
non-zero. This was proved by Coates and Wiles in [CW] for elliptic curves with 
complex multiplication, and by Kolyvagin [Kol] for modular elliptic curves. 
(Recently, a different proof for modular elliptic curves has been announced by 
K. Kato.) Thanks to the breakthroughs of [W3], [TW], and [Di2], the results 
of Kolyvagin and Kato are now unconditional for a very large class of elliptic 
curves. 


The Shimura-Taniyama conjecture for abelian varieties: We may view 
conjecture 1.54 as asserting that the map 


{ Newforms of weight 2 on Xo(V) \ g oe classes of elliptic es 
with rational Fourier coefficients over Q of conductor N 
provided by the Shimura construction is a bijection. It is natural to extend 
this result to all eigenforms, not just those having rational Fourier expansions. 
We say that an abelian variety defined over Q is modular if it is isogenous to 
an abelian variety of the form Ay for some newform f on T(N). It would 
not be reasonable to expect that all abelian varieties over Q are modular: the 
abelian varieties arising from the Shimura construction are very special, in 
many respects. Most importantly, they have a very large ring of endomor- 
phisms. Following Ribet [R7], we make the definition: 


Definition 1.57 An abelian variety over Q of dimension g is said to be of 
G Lə-type if its endomorphism ring over Q contains an order in a field of 
degree g over Q. 


Then we have the following generalized Shimura-Taniyama conjecture: 
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Conjecture 1.58 Every simple abelian variety A over Q which is of GL2-type 
is modular. 


This generalization of the Shimura-Taniyama conjecture is far from being 
proved, and tackling it still seems to require some major new ideas, even 
after the ideas introduced by Wiles. 


2 Galois theory 


2.1 Galois representations 


If F is a perfect field, we will let F denote an algebraic closure of F and 
Gr = Gal (F/F) its absolute Galois group. Recall that G p is a profinite group: 
more precisely, Gr = lim Gal (L/F) as L runs over finite Galois extensions of 


F contained in F. Hence Gp carries a natural topology. If £ is a prime 
different from the characteristic of F we will let ep : Gr — Z% denote the 
é-adic cyclotomic character, i.e. if ¢ is an ¢-power root of unity in F then 
o(¢) = C@) for all o € Gr. We will simply write e€ if the choice of £ is clear 
from the context. 


The finite fields F,: The group Gr, is isomorphic to Z, the profinite com- 
pletion of Z. A natural topological generator is provided by the Frobenius 
element Frob », the automorphism of F, which raises elements of F, to their 
p” power. If £ is different from p, then €¢(Frob p) = p. 


The p-adic fields Q,: The p-adic valuation vp : Q) —» Z extends uniquely 
to a valuation vp : Q% — Q. Let Og, denote the ring of integers of Q, and 
mg, its maximal ideal. The residue field Og,/mg, is an algebraic closure of 
Fp, which we will identify with F,. The valuation vp is compatible with the 
action of Gg,, and hence Og, and mg, are stable under Gg,. In particular 
Go, acts on Og, /mg, and we obtain a map o : Gg, — Gr,, which is in fact 
a surjection. We call the kernel the inertia group and denote it Jp. It has a 
unique maximal pro-p subgroup: the wild inertia group, denoted P,. There is 
a canonical isomorphism 


I/P, = [[ 2. 
l#p 
where the (1) indicates that if f E€ Go,/Pp is a lifting of Frobenius and if 
o €1,/P, then fo f™ = o”. 
The inertia group J, is filtered by a series of subgroups called the higher 
ramification groups. More precisely, there are closed normal subgroups 15 in 
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Gg, for any u € [—1, co}, with the following properties: 
e Ifu<v then I} D Ip, ie., the I5 form a decreasing filtration. 
e For u < 0, I5 = I, while [>° = {1}. 


e Py =Uns0 Fr: 
e y=) T” 


v<u~p* 
These groups are defined in section 3 of chapter IV of [Se2], where they are 
denoted Go, (except that there, Go, = Gg, whereas we have I> + = Ip). 
The local Kronecker-Weber theorem ([Se2], ch. 14, sec. 7, for example) 
asserts that the map 
ox g: Gy — Gr, x ZX. 


is an isomorphism. (We will use G?” to denote the abelianisation of a profinite 
group G, i.e. the unique maximal abelian continuous image.) Under it I, goes 
to Z% and for u > 0, I goes to (1 +p Z) C Z% , where [u] denotes the least 
integer greater than or equal to u. If £ Æ p then eç is trivial on J, and takes 
Frob, to p € Z. 


The field Q: If p is a rational prime, the usual p-adic valuation vp : Q* — Z 
extends to a valuation v : Q* — Q. This extension is not unique, but all 
extensions are permuted transitively by Go. Fix one such extension v, and 
let G, denote its stabiliser in Gg. Then G, acts on the completion (Q),, 
and preserves the subfield of elements algebraic over Qp. This field is an 
algebraic closure of Q,, which we shall identify with Q,. One can check that 
the resulting map Gp — Gg, is an isomorphism. Thus we obtain subgroups 
Gp D I, D I} for u € [-1, co] and a distinguished element Frob, € Gp/Ip. 
These objects depend on the choice of v, and vary by conjugation in Gg as 
this choice is varied. We call an algebraic extension F/Q unramified at p if 
all conjugates of J, lie in Gr C Gg; otherwise we say that F/Q is ramified. 
If F/Q is Galois and unramified at p, then there is a well-defined conjugacy 
class [Frob ,] C Gal (F/Q). 

By replacing the p-adic completion by an Archimedean completion, one 
has a well-defined conjugacy class [c] in Gg consisting of those elements that 
arise as complex conjugation for some embedding Q —> C. We will denote by 
Gæ the subgroup {1, c} for one such element c. 

We have the following fundamental results concerning the structure of Go 
(see for instance [La1]). 
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Theorem 2.1 If F/Q is a finite extension then F is only ramified at finitely 
many primes (those dividing the discriminant of F/Q). 


Theorem 2.2 (Hermite-Minkowski) Zf S is a finite set of primes and if 
d € Z>o then there are only finitely many extensions F/Q of degree d which 
are unramified outside S (contained in a fixed algebraic closure Q). 


Theorem 2.3 (Chebotarev) If F/Q is a Galois extension unramified out- 
side a finite set S of primes then |J gs|Frob p] is dense in Gal (F/Q). 


Theorem 2.4 (Kronecker-Weber) The product of the p-adic cyclotomic 
characters gives an isomorphism 


[eo : G2? > [zx 
Pp Pp 


We remark that the Chebotarev density theorem (theorem 2.3) applied 
to the extension of Q generated by the m'” roots of unity implies Dirichlet’s 
theorem that if n and m are coprime integers then there are infinitely many 
primes p with p = nmodm. We also remark that the Kronecker-Weber 
theorem is equivalent to the fact that the maximal abelian extension of Q is 
obtained by adjoining all roots of unity to Q. 


Representations: A d-dimensional representation of Gg is a homomorphism 
Go — GLK), 


where K is any field. (In our later discussion, we will also consider repre- 
sentations with coefficients in a ring.) Often K (and hence GLa(K)) comes 
equipped with a natural topology, and it is then customary to restrict one’s 
attention to continuous homomorphisms Gg — G Lal K). 

Since any one-dimensional representation has abelian image, theorem 2.4 
allows one to give a complete description of the one-dimensional representa- 
tions of Go together with the behaviour of these representations on the de- 
composition groups at all primes. The aim of this article is to discuss attempts 
to give a similar theory for the two-dimensional representations of Gg. 

We will call a representation p of Gg unramified at p if it is trivial on the 
inertia group J. Otherwise we say that it is ramified at p. If p is unramified 
at p, then p(Frob p) is well-defined (and its conjugacy class is independent of 
the choice of vp). 

If p is a representation of a group G and t is a non-negative integer, then 
we let A‘p denote the representation of G on the it exterior power of the 
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underlying module of p. If H is a subgroup of G, we will let p” (resp. pp) 
denote the H-invariants (resp. H-coinvariants) of the underlying module of 
p. If H is normal in G, then we shall also use p” and py to denote the 
corresponding representation of G/H. In particular if p is a prime and p is a 
representation of Gg or Gg,, then p” (Frob p) and pz, (Frob p) are well-defined. 
We will be primarily interested in three types of representation of Gg. 


e Artin representations, i.e. continuous representations Gg —> GL,(C). 


Since all compact totally disconnected subgroups of GLq(C) are finite, 
Artin representations have finite image. Hence they are semi-simple and 
are unramified at all but finitely many primes, by theorem 2.1. 


Mod £ representations, i.e. continuous representations Gg —> GLa(k), 
where k is a finite field of characteristic 4. These always have finite image 
and hence, like Artin representations, are unramified at all but finitely 
many primes. 


¢-adic representations, i.e. continuous representations Gg > GLal K), 
where K is a finite extension of Q;. We require that an ¢-adic represen- 
tation be unramified at all but finitely many primes. 


Remark 2.5 (a) Continuous representations Gg — GLa( K), unlike those 


(c) 


to GLa(C) or GLalk), may be ramified at infinitely many primes. We 
shall not need to consider such representations. (One rarely if ever does.) 
However, the term “¢-adic representation” is often used in the more 
general sense elsewhere in the literature. 


Note that the image of an adic representation can very well be infi- 
nite. For instance this is the case if d = 1 and the representation is the 
cyclotomic character. 


Note that in the case of mod £ and ¢-adic representations, the represen- 
tations need not be semi-simple. 


Proposition 2.6 Let S be any finite set of primes. 


(a) An Artin representation p : Gg —> GLa(C) is determined by the values 


of tr p(Frob,) on the primes p ¢ S at which p is unramified. 


(b) A semi-simple mod £ representation p : Gg > GLa(k) is determined by 


the values of tr \' p(Frob,) (i = 1,...,d) on the primes p € S at which 
p is unramified. If L > d then it is determined by tr p(Frob,) on the 
primes p € S at which p is unramified. 
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(c) A semi-simple ¢-adic representation p : Go —> GLal K) is determined by 
the values of tr p(Frob,) on the primes p ¢ S at which p is unramified. 


Proof: Combining the Chebotarev density theorem (theorem 2.3) with the 
continuity of p we see that we must show that tr p (resp. tr A’ p) determines p 
up to conjugacy in the various settings. For the characteristic zero represen- 
tations, see [Bour], ch. 8, sec. 12.1, prop. 3; for mod £ representations, this is 
the Brauer-Nesbitt theorem ({CR], (30.16)). 

Let K denote a finite extension of Qy, let O be its ring of integers, A the 
maximal ideal of O and k the residue field. If p : Gg —> GLa(K) is an adic 
representation, then the image of p is compact, and hence p can be conju- 
gated to a homomorphism Gg — GLa(O). Reducing modulo the maximal 
ideal À gives a residual representation p : Gg — GLalk). This representation 
may depend on the particular GL,(K )-conjugate of p chosen, but its semi- 
simplification p** (i.e. the unique semi-simple representation with the same 
Jordan-Hélder factors) is uniquely determined by p, by proposition 2.6 (b). 

Note that the kernel of the reduction map GLg(O) — GLa(k) is a pro- 
é-group. In particular we see that if p # £ then p(P,) is finite and p(P,) is 
isomorphic to p(P,). 

Suppose p is a representation of Go of one of the three sorts above. If p 4 £ 
then we define the conductor, m,(p), of p at p by 


Mp(p) = a codimp’”” du = codimp” +f codimp”’ du. 


This is well-defined as p(P,) is a finite group. If p is an Artin representation 
this makes sense also for p = l. (It would even make sense for a mod £ 
representation if p = £, but in this case it does not seem to be a useful notion.) 
It is known that m,(p) is an integer (see chapter VI of [Se2]). Moreover it is 
easily seen that m,(p) = 0 if and only if p is unramified at p. We define the 
conductor N(p) of p to be 

I] pO, 

p 


where the product is over all primes p Æ £ in the case of an 4-adic or mod £ 
representation and over all primes in the case of an Artin representation. This 
makes sense because p is unramified almost everywhere. 

The following lemma is an exercise. 


Lemma 2.7 Suppose that p : Go —> GLal K) is an L-adic representation and 
that p : Gg —> GLalk) is a reduction of p. Then for each prime p 4 € we have 
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fo codimp’? du = fY codimp’? du. Thus 


= im p/p —dim p7? 
NG) = Nol peer Seer 
pF 


and in particular N(p) divides N(p). 


We take this opportunity to introduce some important notation. Let G 
denote a group and R a ring. If p : G > GLa(R) is a representation we 
shall let M, denote the underlying R[G]-module (so that M, = R? as an R- 
module). If M is an R[G]-module we shall let End (M) denote the module of 
R-linear endomorphisms of M. It is also an R[G|-module with the G-action 
being defined by 


(go) (m) = g(¢(g-*m)). 


If M is a finitely generated free R-module we will let End°(M) denote the 
sub-R|G|-module of End (M) consisting of endomorphisms of trace zero. We 
will use adp to denote End (M,) and ad°p to denote End°(M,). Note that if 
d is invertible in R then adp ~ ad°p @ R as R[G]-modules, where R has the 
trivial action of G. Note also then that the kernel of ad°p is the same as the 
kernel of the composite map G > GLa(R) > PGL4(R). 

We also remark that if R is an algebraically closed field of characteristic 
other than 2 and p : G — G L2(R) is irreducible, then either ad°p is irreducible 
or there is a subgroup H C G of index 2 and a character y : H — R* such that 
p = IndGy. In the latter case ad°p = 6@Ind G(x/y’), where 6 is the nontrivial 
character of G/H and x’ is the composite of x and conjugation by an element 
of G — H. Moreover, either Ind G(y/x’) is irreducible or ad°p & 6; © 62 @ 63 
where the ô; are distinct characters of G of order 2. In the latter case, p is 
induced from a character from each of the subgroups H; = ker 6; of index 2 in 
G. We leave the verification of these facts as an exercise to the reader. 


2.2 Representations associated to elliptic curves 


Perhaps the simplest examples of non-abelian ¢-adic representations arise from 
elliptic curves defined over Q. 


We will let E/n|(Q) denote the group of n-torsion points on E(Q). By 


proposition 1.1 there is a non-canonical isomorphism E[n](Q) = (Z/nZ)?. 


Furthermore, E[n] (Q) carries a natural action of Go and so we get a represen- 
tation (defined up to conjugation) 


Pen: Go > GLo(Z/nZ). 
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If £ is a prime different from the characteristic of Q then we set “E = 


lim Z|] (Q), which is non-canonically isomorphic to Z?. Again it has a nat- 


ural continuous action of Gg, and so we get a representation (defined up to 
conjugation) 
PEL: Go _ GL2(Z,). 


Note that PE g — (pee mod L). 


Global properties: We have the following basic properties of the represen- 
tations gen and pry: 


Proposition 2.8 (a) The determinant of pry is ee. 


(b) The representation pry is absolutely irreducible for all £. For fixed E, 
Pre is absolutely irreducible for all but finitely many £. 


(c) If E does not have complex multiplication then pge (and hence pry) is 
surjective for all but finitely many £. 


Proof: Part (a) follows from the existence of the non-degenerate alternating 
Galois-equivariant Weil pairing 


TE x ToE — Z,(1) := lim pen. 


Part (b) is proved in [Se5], ch. IV. Part (c) is the main result of [Se6]. = 

The following deep result, which is stronger than the second assertion of 
part (b) of proposition 2.8, is due to Mazur [Maz2], thms. 1 and 2. (Parts (b) 
and (c) can actually be deduced directly from theorem 1.7, using the fact that 
a semi-stable curve with a rational subgroup of order £ is necessarily isogenous 
to a curve with a rational point of order £.) 


Theorem 2.9 Suppose that E/Q is an elliptic curve. 
(a) If £ > 163 is a prime then pry is irreducible. 
(b) If E is semistable then pry is irreducible for £ > 7. 
(c) If E is semistable and pry is trivial then pry is irreducible for £ > 3. 


Remark 2.10 Combined with the results of Serre [Se6], Mazur’s results imply 
that if Æ is semistable everywhere then ge is surjective for 0 > 7 ([Maz2], 
thm. 4). 


Local behaviour of ppg and pry: 
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Proposition 2.11 Suppose E has good reduction at p. 
(a) If€ Ap, then pre is unramified at p, and we have the formula 
tr py e(Frob yp) = p+1— #E,(Fp). 


In particular tr ppe(Frob,) belongs to Z and is independent of £ 4 p. 
(b) For alln > 1 there is a finite flat group scheme Fa/Zp such that 
E[p"|(Q) = Fn (Qp) 
as Gp-modules. 


(c) œ If E has good ordinary reduction at p (i.e., ap is not divisible by p) 
then E,|p\(F>) has order p and 


Ep * 
AEE 


e If E has supersingular reduction at p, then Ep|p|(Fp) is trivial and 
PE pla, is irreducible. 


Proofs: The proof of part (a) can be found in chapters V and VII of [Si1]. 
For part (b) one considers the finite flat group scheme F, = €[p"], where E 
is the model for E over Z, defined by (W™"). (See [Sha] for an introduction 
to theory of finite flat group schemes.) Part (c) follows from the results in 
chapters V and VII of [Sil] together with basic results on finite flat group 
schemes. 


Proposition 2.12 Suppose that E has multiplicative reduction at p. Let q = 
dep E PZp be the multiplicative Tate period attached to E, defined as in sec- 
tion 1.1. Let ô: Gp/Ip > {£1} be the unique non-trivial unramified quadratic 
character of Gp if E has non-split multiplicative reduction, and let 0 be the 
trivial character if E has split multiplicative reduction. Then 


(a) 
PEsla, ~ (4 1) 8 ð, 
and if 0A p then mp(pge) = 1. 
(b) 
Peela, ~ (4 1) 8 ð, 
and Y € H'(Gg,,Z/lZ(1)) = QX /(Qs)’ corresponds to the Tate period 
dap © Q g) 
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(c) If€ # p then mpre) = 0 (i.e., Pre is unramified at p) if and only if 
tulak”) = =le). 


(d) There is a finite flat group scheme F/Z, such that E[p|(Q,) = F(Q,) if 
and only if pvp AR”) = —vp(jz).- 
Sketch of proof: By Tate’s proposition 1.5 of section 1.1, we have 
E(Q,) = (QF /4*)(5) 


as Gp-modules. (The (ô) indicates that G, acts on (Q*/q*)(6) by o : z => 
o(x)°\).) The proposition follows directly from this, together with [Edi], prop. 
8.2. for the last part. (For further details, see ch. VII, prop. 5.1 of [Sil], ch. 
IV, thm. 10.2. and ch. V of [Si2], and sec. 2.9 of [Se7].) 


Proposition 2.13 Suppose that E has additive reduction at p. Then for any 
L Æ p the conductor of prela, is at least 2, and if p > 3 then it is equal to 2. 


Remark 2.14 For an elliptic curve E with any type of reduction at p, the 
conductor of pela, (for l # p) coincides with the local conductor of E at p 
given by a formula of Ogg (proved by Saito [Sa] in the case p = 2). In particular 
the conductor of pzy|qg, is independent of ¢ # p. For further discussion and 
references, see ch. IV, secs. 10 and 11 of [Si2]. 


The Frey curve: We now consider the elliptic curve 
Eap:iY?Z=X(X —AZ)(X + BZ), 


where A and B are non-zero coprime integers with A+ B # 0. Set C = 
A+B. This equation has discriminant A = 16(ABC)?. If p is an odd prime 
then this curve has good reduction at p if p{(ABC) and has multiplicative 
reduction at p if p ABC. Thus Ag, = 2^" (ABC)? for some n € Z>o. 
If for instance A = —1 mod 4 and B = 0 mod 16 then E4, g has a minimal 
Weierstrass equation 


B-A-1l1 
Y°Z+XYZ=X’+ = — (AB/16)XZ?. 
In this special case F4, g has semi-stable reduction everywhere and AR. = 
278(ABC}’°. 


The connection with Fermat’s Last Theorem is provided by the elliptic 


curve 
E:Y°Z=X(X —afZ)(x +bfZ) 
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considered by Hellegouarch [He] and Frey [Fr], where a‘ + bf = c is a hy- 
pothetical non-trivial solution to Fermat’s Last theorem, and where we have 
supposed without loss of generality that a, b and c are pairwise coprime, that 
b is even and that a = —1 mod 4. Frey suggested that properties of this hy- 
pothetical elliptic curve Æ could lead to a contradiction if the curve were also 
known to be modular. This suggestion was made precise by Serre [Se7], who 
observed that the following list of properties would yield the desired contradic- 
tion, when combined with his conjectures on Galois representations associated 
to modular forms (to be discussed in section 3.2). 


Theorem 2.15 Suppose that L > 3 is prime. The representation pge has the 
following properties. 


(a) Pry is irreducible. 


(b) Pry is unramified outside £ and 2. 


(c) There is a finite flat group scheme F/Z, such that F(Qv) S E[é|(Qe) as 
Gy-modules. 


(d) #Ppe(l2) = £. 


Proof: Part (a) follows from Mazur’s theorem (theorem 2.9, (c)). To prove (b), 
(c), and (d), we note the key fact that A@™ = 278(abc)” is a perfect ¢-th power, 
up to powers of 2. The fact that pg, is unramified at p 4 £, 2 (part (b)) follows 
from this, from proposition 2.11, (a) (for primes p 4 £ of good reduction for Æ) 
and from proposition 2.12 (c) (for primes p 4 ¢,2 of multiplicative reduction 
for E). Part (c) is a consequence of proposition 2.12 (d), and part (d) follows 
from proposition 2.12 (c). Oo 


Remark 2.16 Part (a) implies that the image of pz, is large, while parts (b) 
and (c) state that the image has very limited ramification. We remark that 
the conclusion of the theorem ensures that m,(fz,) = 0 for p # 2, @ and that 
mo(Pry) = 1. 


2.3 Galois cohomology 


In this section M will denote a continuous discrete G'g-module of finite cardi- 
nality. If M and N are two such modules we will endow the space Hom( M, N) 
of homomorphisms of abelian groups with an action of Gg via the formula 


(go) (m) = g(¢(g-*m)). 
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We will use M* to denote Hom(M, un(Q)), where n is an integer such that 
nM = (0) and where pz,(Q) denotes the group of nt? roots of unity in Q. 
We will use V to denote Pontryagin duals. All cohomology groups of profi- 
nite groups will mean continuous cohomology. The general references for this 
section are Milne [Mi] and Serre [Sel]. 

We start by recalling Tate’s local duality theorem (see chapter 1, section 2 
of [Mi]). 


Theorem 2.17 Letv be a place of Q (i.e. either a prime number or oo). 
(a) H'*(G,,M) is finite for all i. 


(b) For all integers n there are compatible embeddings H?(Gy, Un(Q ) => 


Q/Z (i.e. compatible with the maps coming from Un(Q) = Um(Q) if 
n|m). 


(c) Forv 4 œ andi =0,1,2 the cup product and the above embeddings give 
rise to a perfect pairing 


H"(Gy, M) x H?*(G,, M*) > H? (Gu, tn(Q)) > Q/Z. 


(This also holds when v = œ, if one replaces H°(G,, M) = H° (R, M) by 
Tate’s modified cohomology group: H®(R,M) := M7™="/(1+7)M, where 
T is complex conjugation. ) 


(d) Ifv #00 is a prime, then H*(G,,M) = (0) fori > 2 and 
#H'(Gy,M) = #H°(Gy, M)#H* (Gy, M)#(M 8 Z,). 


(e) Ifv 4 œ and M 8 Z, = 0 then H1(G,/I,,M"™) and H\(G,/I,,M*”) 
are the annihilators of each other under the pairing 


H(G,, M) x H!(G,, M*) > Q/Z. 


By a collection of local conditions for M we shall mean a collection £ = 
{L,} of subgroups L, C H!(G,, M) as v runs over all places of Q, such that 
L, = H'(G,/I,, M®) for all but finitely many v. Note that if £ is a collection 
of local conditions for M then £* = {L}} is a collection of local conditions for 
M*. If £ is a collection of local conditions for M we define the corresponding 
Selmer group, H7(Q, M), to be the subgroup of x € H! (Go, M) such that for 
all places v of Q we have 


resz € Ly C H'(G,,M). 


60 


Suppose that v is a finite place of Q. We have an exact sequence 


(0) + H°(Gy,M) > Mh P25" Mh > H'(G,/Iv, M™) > (0), 
and hence we see that #H1(G,/I,, M™®) = #H°(G,, M). 

We have the following important observation of Wiles [W3], inspired by a 
formula of Greenberg [Gre]. An important theme in number theory has been 
the calculation of Selmer groups. This result, although it does not allow the 
absolute calculation of Selmer groups, allows the comparison of two (dual) 
Selmer groups. In the applications in this paper we shall apply the theorem 
where the various data have been chosen to make one of the Selmer groups 
trivial. In such a situation it allows the exact calculation of the order of the 
non-trivial Selmer group. 


Theorem 2.18 If £ is a collection of local conditions for M then the Selmer 
group HŁ(Q, M) is finite. Moreover we have the ane 


#H}(Q, M) #H° (Go, M SII 
#HŁ.(Q,M*) #H°(Go, M*) ae n 


We note that all but finitely many terms in the product are 1 so that it makes 
sense. 

Proof of theorem 2.18: Choose a finite set, S, of places of Q, which contains oo, 
all the places whose residue characteristic divides the order of M, all places 
at which M is ramified and all places at which L, # H!(G,/I„, M™). Let 
Qs denote the maximal extension of Q unramified outside S and let Gs = 
Gal (Qs/Q). Then we have an exact sequence 


(0) > HŁ(Q, M) > H' (Gs, M) + @ H(G., M)/L, 
vES 


Because H!(Gs, M) is finite (see cor. 4.15 of ch. 1 of [Mi]), we see that 
H}(Q, M) is finite. 

Dualising this sequence for M* and using theorem 2.17, we see that we 
have an exact sequence 


E Le > H! (Gs, M*)“ > H} (Q, M*)“ = (0). 


ves 


On the other hand we have the following part of the Poitou-Tate nine term 
sequence (see thm. 4.10 of ch. 1 of [Mi]): 


(0) A°(Gs,M) > (Byes H(G, M)) /(L + c)M > H*(Gs, M*)Y 
H! (Gs, M*)Y — Des H'(G», M) — H'(Gs,M), 
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where we regard (1+ c)M as contained in H°(G,,,M). Replacing H'(G,, M) 
by L, for v € S and combining this with the previous exact sequence we get 


(0) > H%Gs,M) > (Pues H(G., M)) /(1 + c)M 
Ð, g Lo S H} (Q, M) i H? (Gs, M*)* 
H!(Gs, M*)Y > HŁ. (Q, M*)¥ —> (0). 


Theorem 2.18 follows from this and the global Euler characteristic formula 
(see thm. 5.1 of ch. 1 of [Mi]}): 


#H"(Gs, M*) = (#M*) 
#H?(Gs, M*)#H2?(Gs, M*) #H°(G., M*) 


= #(1 + c)M. 


We remark, but will not use elsewhere in this article, that the above argu- 
ment works over any number field: 


Theorem 2.19 Let F be a number field and M a discrete Gr module of finite 
cardinality. For each place v of F let G, denote a decomposition group at v 
and if v is finite let I, C G, denote the inertia group. Fix subgroups Ly C 
H'(G,,M), and such that for all but finitely many v, L, = H'(G,/I,, M®). 
Let Hp(F,M) (respectively, Hp.(F,M*)) denote the inverse image of Į], Lu 
(respectively, Į], Li) under the map H!(Gr, M) — I|, H(G, M) (respec- 
tively, H'(Gr,M*) > [[, H'(G., M*)). Then HŁ(G, M) and H}.(G,M*) 
are finite and we have the formula 


#Hp.(F, M*) 3 #H° (Gr, M*) v #H(G,,M) 


2.4 Representations of Go, 


In this section we assume that £ is an odd prime. If G is any topological 
group then by a finite O|G|-module we shall mean a discrete O-module of 
finite cardinality with a continuous action of G. By a profinite O[G]-module 
we shall mean an inverse limit of finite O|G|-modules. 

If M is a profinite O[G;|-module then we will call M 


e good if for every discrete quotient M’ of M there is a finite flat group 
scheme F/Z, such that M' = F(Q,;) as Z,|G;|-modules; 
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e ordinary if there is an exact sequence 
(0) > MCD — M — MO — (0) 


of profinite O[G;|-modules such that Jy acts trivially on M and by e 
on M? (equivalently, if for all 0,7 € Ip we have (o — e(a))(r — 1) = 0 
on M); 


e semi-stable if M is either good or ordinary. 


Suppose that R is a complete Noetherian local O-algebra with residue field K. 
We will call a continuous representation p : Ge — GL2(R) good, ordinary or 
semistable, if 


det plz, = € (2.4.1) 


and if the underlying profinite O[G,|-module, M, is good, ordinary or semi- 
stable. We write p for p mod mg. We record the following consequence of 
Nakayama’s lemma. 


Lemma 2.20 If M, and p are ordinary, then ME” and Mo are each free 
of rank one over R and p is ordinary. 


Remark 2.21 (a) These definitions are somewhat ad hoc, but at the mo- 
ment that is all that seems to be available (though the work of Fontaine 
and Laffaille [FL] and its generalisations may well provide a more sys- 
tematic setting). 


(b) For part of the motivation for our definitions, see proposition 2.23 and 
remark 2.24. For further motivation, we shall see in the next chapter 
that representations arising from certain modular forms are semistable. 
Moreover the Fontaine-Mazur conjecture predicts that, conversely, any 
representation of Go with semistable restriction to Gg arises from such 
a modular form. We shall state a weak form of the Fontaine-Mazur 
conjecture below (conjecture 3.17). 


(c) The terminology in [W3] to describe representations of Ge is slightly 
different. In particular, we impose the condition (2.4.1) in our definitions 
of good and ordinary, as this is all we shall need here. Assuming p 
satisfies this condition, the notion of ordinary in [W3] coincides with the 
one here, and p is flat in the sense of [W3] if and only if it is good but not 
ordinary. (Note that a representation may be both good and ordinary, 
for instance ppg for an elliptic curve with good, ordinary reduction; see 
proposition 2.23.) 
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The following assertions, in the good case, are consequences of results of 
Raynaud [Ray] (in particular, see sec. 2.1, prop. 2.3.1 and cor. 3.3.6 of [Ray]). 
In the ordinary case they are elementary. 


Lemma 2.22 (a) Stable profinite O|G,|-modules are closed under taking 
sub-objects, quotients and direct products. The same is true for ordi- 
nary profinite O|G,|-modules. 


(b) Suppose that M is a profinite O|G:|-module and that {M;} is a family of 
sub-objects with trivial intersection, such that each M/M; is good (resp. 
ordinary). Then M is good (resp. ordinary). 


(c) If M is a finite O[G,]-module then M is good if and only if there is a 
finite flat group scheme F/Z, such that M = F(Q) as Z,[G¢|-modules. 


(d) IfM and M' are profinite O|G:|-modules with M' = M as Zelle] -modules 
then M is good (resp. ordinary) if and only if M' is good (resp. ordinary). 


(e) Suppose that M is a profinite O|G:|-module which is finite and free over 
O. Then M is good if and only if there exists an ¢-divisible group F /Z¢ 
such that M is isomorphic to the Tate module of F as a Ze|Gy|-module. 


Together with results stated in section 2.2, we have the following. 
Proposition 2.23 Suppose that E is an elliptic curve over Q and O = Ze. 


e if E has good (resp. semistable) reduction at £, then pge and pry are 
good (resp. semistable); 


e if E has semistable reduction at L, then pry is ordinary if and only if 
Pry is ordinary if and only if either E has multiplicative reduction or E 
has good ordinary reduction. 


Remark 2.24 It is also true that if pg, is good (resp. semistable), then E 
has good (resp. semistable) reduction at £, but the result is more difficult and 
we shall not need it. 


We will need a few more definitions. We will let w : Ip —> F> denote 


the character o + (ow)/w mod w where w = “70. If F is a field of 
characteristic other than £ and if M is a profinite Z,/G,|-module then we 


set M(1) = lim M @z, er (F). If p : Ge > GL2(R) is ordinary the extension 
(0) > R(1) > M, > R > (0) 
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of R|I;]-modules gives rise to a class c, € H'(Ip, R(1)). Kummer theory and 
the valuation on (Q/‘)* give rise to a map v : H(I, R(1)) > H @z, R > R, 
where H denotes the ¢-adic completion of (Q))*. Then we also have the 
following lemma. 


Lemma 2.25 (a) If p: Ge + GLa(k) is good then either p is ordinary or 
pl, 8 k = 4 Pyt. 


(b) If p: Ge > GL2(R) is such that M, is good and p is ordinary, then p is 
good and ordinary. 


(c) If p: Ge > GLa(R) is ordinary then p is good if and only if v(c,) = 0. 


Note that we need only consider the case that R has finite cardinality to 
prove the lemma. Parts (a) and (b) again follow from Raynaud’s results [Ray] 
(for part (b) consider the connected-étale sequences for M, and Mp). 

We sketch the proof of part (c) using an argument suggested to us by 
Edixhoven. As in [Edi], prop. 8.2 it suffices to determine which extensions 


(0) — R(1) — M — R — (0) (2.4.2) 


of R|I;]-modules arise from finite flat group schemes over Zọ", the ring of 
integers of the maximal unramified extension of Z. By prop. 17.4 of [Oo], 
the extension (2.4.2) arises from a finite flat group scheme if and only if it 
corresponds to an extension of sheaves of R-modules in the fpqc topology over 
Zy. Therefore we must compute the image of 


Erte noona Ph R(1)) k Ext pir) (R, R(1)). 


Since the sheaf Ext! of R by R(1) vanishes, this is equivalent to computing 
the image of 


1 
H pac 


(ZF, R(1)) > H (l, R(1)). 


Part (c) follows from the fact this is precisely the kernel of v. 


Remark 2.26 The authors expect that the theory of Fontaine and Laffaille 
[FL] discussed in the next section could be used to prove that if p : Ge —> 
G'L2(R) is such that M, is good and J is good, then det p|z, is cyclotomic, i.e., 
p is good. 


For semistable representations p : Ge —> GLa(O/à") we shall define O- 
submodules 
A; (Ge, adp) C H&(Ge, adp) C H' (Ge, adp). 
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Before doing so, let us consider more generally a continuous representation p 
of a profinite group G with values in GL4(O/A"). For each continuous cocycle 
E : G — adp, we define a representation 


pe:G— GL,(R,) 
gt (1 + €&(9))p(9) 


where R, = (O/A")[e]/(e”). We find that the map € — pẹ induces a bijec- 
tion between H'(G,adp) and equivalence classes of representations p’ : G —> 
GL2(R,) such that p = p' mod éR,, where p} and ph are deemed equivalent 
if they are conjugate by an element of 1 + ¢M2(R,,). Now let M, denote the 
(O/A")|G|-module corresponding to p, and for each continuous cocycle €, let 
Eg denote the R,,|G|-module corresponding to pe. Note that multiplication by 
e defines an isomorphism from M, = E¢/eE¢ to eE¢ of (O/X")[G]-modules. 
We thus obtain an extension 


0-M,- Es > M, > 0 


and hence a class in Ext'(M,,M,) in the category of profinite (©/A®)[G]- 
modules. Moreover if pe, and pe, are equivalent, then Ez, and E¢, define the 
same extension class. We thus obtain a map 


H'(G, adp) > Ext'(M,, Mp), 


which the reader can check is an (O/X")-linear isomorphism. Classes in 
H'(G,ad°p) correspond to the equivalence classes of representations p’ sat- 
isfying det p’ = det p, and ¢ in Hom(G, O/A”) corresponds to the twist of p by 
the character g ++ 1+e€¢(g). In the case ¢ Jd where we have the decomposition 


H! (G, adp) = H*(G,ad°p) 6 Hom(G, O/A"), 


we see that if p’ corresponds to a class in H'(G,adp), then the projection to 
Hom(G,O/X") is the homomorphism ¢ such that det p’ = (1 + edo) det p. 
We now return to the case of a semistable representation 


pP: Gy a GL2(O/A") 


and define the cohomology groups H;(G:,adp) and H} (G, adp) as follows. 
Let Hi(Ge,adp) denote the natural image in H(G, adp) of Ext'(M,, Mp) 
taken in the category of semi-stable profinite O/A"|G;|-modules. If p is not 
good then H} (G;, adp) is defined to be Hi(G,adp). If, however, p is good 
then H;(G,,adp) will denote the natural image in H'(G;, adp) of the group 
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Ext'(M,, M,) taken in the category of good profinite O/\"[G|-modules. We 
define H}(Gp,ad°p) (resp. Hi (G:,ad°p)) as the intersection of H}(G¢, adp) 
(resp. Hi (G, adp)) and H'(G,, ad°p). Note that if p is good (resp. semistable) 
and € : Gp — adp is a cocycle, then E¢ is good (resp. semistable) if and only 
if pe is. 

Suppose that p : Ge —> GL2(O/X") is ordinary. Consider the exact se- 
quence 

0- MC = M, `> MO — 0, 


where Mc denote the maximal submodule of M, where Ip acts by e. Let 
ad‘) p denote the submodule Hom(M)”, MOY) of ad°p. Then 


Hi (Go, adp) = ker (H! (Gg, adp) > H*(Ip, adp/ad~p)). 


The same is true with ad’ replacing ad. 
If p : Ge > GL2(O) is semi-stable then we set 


H; (Ge, adp & K/O) = lim H} (Ge, adp & (A-"/O)) C H! (Gy, adp @ K/O). 


We define H} (Gg, ad°p @ K/O) as 
H; (Ge, adp ® K/O) N H*(G;,ad°p 8 K/O). 


We make similar definitions for H} (Gg, adp@ K/O) and H}(G;,ad°p@ K/O). 
We will need the following calculations. 


Proposition 2.27 (a) Suppose that p: Ge > GL2(k) is semi-stable. Then 
#H; (Ge, ad?) < #H°(Ge, ad°p)#k 
and equality holds if p is ordinary’. 


(b) Suppose that p : Ge — GL2(O) is both good and ordinary. Let x, and x2 
be the unramified characters such that p ~ Gy o Let 


Ce = (x1/X2)(Frob ) mar 
If ce #0, then 
H4 (Gi, ad°p 9 K/O)/ Hi (Gi, ad°p 9 K/O) 


is finite of order #(0/&0). 
3The authors expect equality to hold without this hypothesis; cf. remark 2.26. 
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(c) If p mod J is either not good or not ordinary, then 
H2(Gy,ad°p 8 K/O) = H} (Gu, adp ® K/O). 


Proof: Part (c) is clear, and for parts (a) and (b) it suffices to prove the 
following two results. 


Proposition 2.28 Suppose that p : Ge > GL2(O/A") is good. Then 
#H; (Go, adp) = #H°(G;, ad°p)(#O/X")?. 


Lemma 2.29 Suppose that p : Ge > GL2(O/X") is ordinary, so that we have 
p~ Ce D for some unramified characters x, and xə. 


(a) We have 
#H a (Ge, ad’ p) < #H? (Ge, ad’ p)#(O/A")#(O/(A", ce)) 
where ce = (X1/X2)(Frob )—1. Moreover if p is good, then equality holds. 
(b) Ifn=1 and p is not good then #H} (Ge, ad? p) = #k. 


End of proof of proposition 2.27. To deduce proposition 2.27 (in the good case) 
from proposition 2.28 and lemma 2.29, note that if p : Ge > GL2(O/A") is 
good, then 


H! (Ge/Ie, O/X") C H} (Ge, adp) N H* (Ge, O/A") 


by lemma 2.22 (d), and this gives the inequality in proposition 2.27(a). Fur- 
thermore if p is also ordinary, then the above group is contained in 


Im (H; (G2, adp) > H*(G,,O/X")) C H' (G/L, O/A") 
where the last inclusion comes from lemma 2.25 (b). Therefore 


#H; (Ge, ad°p) = #H° (Ge, ad? p)#(0/A"). 


We will prove proposition 2.28 in the next section using the theory of 
Fontaine and Laffaille. The proof of lemma 2.29 is a somewhat technical 
exercise in the Galois cohomology of local fields, for which we refer the reader 
to ch. 1 of [W3], prop. 1.9, parts (iii) and (iv). We remark that for our 
purposes, inequalities would suffice in part (b) of proposition 2.27 and part (a) 
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of lemma 2.29 (and this is all that is proved in [W3]). We have included the 
precise formulas for the sake of completeness, since they are not much more 
difficult to obtain. The only additional observation required is that if p is 
good, then the composite of the natural maps 


A" (Ge/Le, (ad°p/aad— p)"*) — H! (Gi, ad?p/ad” p) — H? (Gr, ad) p) 
is trivial. To prove this, rewrite the composite as 
H (Gefle, O/A") > H! (Gi, O/A") > H? (Ge, ad p) 


with the second map given by Uc, where c, is the class in H (Gp, ad p) 
defining the extension M, and apply lemma 2.25 (c). (In fact, one only needs 
the easier half of lemma 2.25 (c): if p is good then vu(c,) = 0.) 


2.5 The theory of Fontaine and Laffaille 


In this section we again assume that @ is an odd prime. We mentioned in 
the last section the importance of understanding good representations of Gy. 
However the definition of good is somewhat indirect, and this makes computa- 
tions difficult. The key result we use to address the problem is an equivalence 
between the category FFo of good finite O|G:|-modules and a category MFo 
which we define below following Fontaine and Laffaille [FL]. The beauty and 
utility of the result stems from the elementary algebraic nature of the def- 
inition of MFo; we can convert questions about good representations into 
questions in linear algebra. 


Remark 2.30 Let GFo denote the category of finite flat commutative group 
schemes over Z; with an action of ©. By results of Raynaud [Ray], taking 
Q; points defines an equivalence between the categories GFo and FFo. The 
equivalence between GFo and a category closely related to MFo was first 
established by Fontaine [Fo2] and [Fol]. While Fontaine’s results would suffice 
for our purposes here, our formulation will be closer to that in [FL], where 
an equivalence between MFo and F Fo is defined as part of a more general 
construction of representations of Ge from linear-algebraic data. We caution 
however that our formulation is not exactly the same as that of [FL] since we 
wish to work with covariant functors. 


We now turn to the definition of MFo. The objects are O-modules D 
of finite cardinality together with a distinguished submodule D° and O-linear 
maps ¢_;: D — D and ġo : D? — D which satisfy: 
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e 6-1|p = C40, 
e Im @_1 -+ Im ġo = D. 
The morphisms are O-linear maps compatible with the additional data of the 


distinguished submodules and maps @. 
It is useful to note that if D is an object of MFo, then there is a surjection 


¢-1 © bo: D/D? © D? /LD° —> D/ldbo(D"), 


and on counting orders we see that this is in fact an isomorphism. Thus there 
is an isomorphism 


D/(D° + AD) @ D9/XD® > D/AD. 


It follows that D°/\D° — D/XD is injective, and hence that D? is (non- 
canonically) a direct summand of D as an O-module. Note also that @o is 
injective, and if D = D’ @ D’ as O-modules, then also D = ¢9(D°) $ d_1(D’) 
as O-modules. 


It is then straightforward to check that there is a contravariant functor -* 
from MFo to itself defined by: 


e D* = Hom(D, Qy/Z»); 

e (D*)° = Hom(D/D®, Qe/Z.); 

e $2i(f)(z) = f(€x +y), where z = ġ-1 (x) + oly); 

© ¢0(f)(z) = f(x mod D°), where z = ġ-ı(£) mod (o D°). 


Moreover the canonical isomorphism D = (D*)* of O-modules defines a natu- 
ral isomorphism in M Fo. 

We leave it as an exercise for the reader to use the above observations to 
define cokernels and then kernels of morphisms in MFo and verify that it is 
an abelian category (or see [FL], sec. 1). 


Theorem 2.31 There is a covariant functor D : FFo > MFo which defines 
an O-additive equivalence of categories. Moreover if M is an object of FFo, 
then we have 


(a) M and D(M) have the same cardinality; 
(b) D(M) = D(M)? if and only if M is unramified. 
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Remark 2.32 (a) It follows on applying part (a) to M/A'M for each i that 
M and D(M) are in fact (non-canonically) isomorphic as O-modules. 


(b) As mentioned above, our formulation differs from that of Fontaine and 
Laffaille in that we are using covariant functors. To deduce theorem 2.31 
from the results in [FL], sec. 9, we define D as a quasi-inverse of the 
functor U,(-*[1]), where U, is defined in [FL] and [1] indicates a shift 
by 1 in filtration degrees. In particular, if F is a finite flat group scheme 
over Ze with an action of O, then the underlying O-module of D(F(Q,)) 
can be identified with the covariant Dieudonné module of F/z,, and ¢_1 
with F. 


Suppose now that p : Ge — GL2(O/A") is a good continuous repre- 
sentation. Let D, = D(M,) be the corresponding object of MFo. Then 
D, = (O/X")? as an O-module while DÌ = O/A". 


Lemma 2.33 The following are isomorphic. 


(a) The group of extensions of M, by itself in the category of good finite 
O/X"|Ge|-modules. 

(b) The group of extensions of D, by itself in the full subcategory of MFo 
consisting of objects which are O/X"-modules. 

(c) Pairs (a_1,09) where a_; € Homo(D,,D,), ao E€ Home(D?, Dp) and 
lao = a|n, modulo pairs of the form (abp,-1 — %p,-14; @p,0 — bp,04| D0) 


where a € Homo(D,, Dp) and aD? C DÌ. 


Proof. The first two groups of extensions are isomorphic by the Fontaine- 
Laffaille theorem. Following Ramakrishna [Ram] we explain how to calculate 
the second group of extensions. We will write D for D,. If 


(0)—> D> E > D — (0) 


is an extension then we have as O-modules that E & D? by an isomorphism 
such that E° > (D°)?. Then E is determined by giving elements ag; € 
Hom(D, D) and ago € Hom(D°, D) with lago = ag-1|po. Explicitly 


QE, -1 = Gs a) and = ¢go0 = Ge ore) 


0 


Two such extensions Æ and E’ are isomorphic if there is an element a € 
End (D) such that aD? C D?, 


A a (ot): 
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and 
(02) (PR) = (BR) OP) 


or equivalently if aD? C D?, ag -1 — ap -1 = ¢-1a— aġ—ı and apy — QE o = 
poa|po — ago. The lemma follows. 


Corollary 2.34 The group, H}(Ge, adp), of extensions of M, by itself in the 
category of good, finite O/A" [Gı]-modules is (non-canonically) isomorphic to 
(O/X")? ® H? (Ie, Mp). 


Proof. Choose generators eo, e—1 of D, such that eo is a generator of D°. With 


respect to this basis let @9 have matrix ( Z) and ¢_; have matrix (z A If 


a has matrix (3 =) then a direct calculation shows that 
(ag_1 — b-14, agp — oa|po) = 
faz  (aı— a3)y + ao(w — £x) ( a22 ) 
(a3 — alz —lza2 »\ (a3 —ai)z) J 


Thus the Ext group we want is the quotient of the set of pairs of matrices 
(@_1,Q9) as above by the submodule generated by 


lz w — bx Z 0 —y 0 
(E7) (6) ama (Cee?) (2): 
Note that either z or w is a unit in O/A from which it follows that the Ext 
group we want is isomorphic to (O/XA")? 6 O/(z, A"). On the other hand 


H? (Ie, M,) corresponds to the largest submodule C C D° such that ¢oC = C, 
ie. C= {de D? : zd = 0}  O/(z, A”). 


Corollary 2.35 Suppose that p : Ge > GLI2(O/A") is a continuous good 
representation. Then H}(G,adp) is isomorphic to (O/X")? 6 H?(G;, ad? p). 


Proof: If p is not ordinary then H°(G,,ad°p) and H°(Ip, M,) are both triv- 


ial, so suppose that p is ordinary. In this case, let p’ denote the ordinary 
representation defined by 


My = Hom(M", M,,) C ad°(p) 
and let Mo» =ad°(p)(-). Then 


H? (Ge, ad? p) = H? (Ge, Mp) C H°(Ip, My) S H? (Ie, Mp). 
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Since H°(Ip, MS H is trivial, the restriction homomorphism 
—1 =i 
HG, MO”) = HM) 
is injective. We deduce from the long exact sequences associated to 


(0) > Mo > My => O/A" > 0 


that H°(Go, Mg) = H? (Ip, My). 


2.6 Deformations of representations 


In this section we shall review Mazur’s theory of deformations of representa- 
tions of profinite groups (see [Maz3]). 

Let Co denote the category whose objects are complete noetherian local 
O-algebras with residue field k and whose morphisms are O-algebra homomor- 
phisms which are local (i.e. take maximal ideals into maximal ideals). (The 
structure maps from O to every object of Co are also assumed to be local.) Let 
G denote a topologically finitely generated profinite group and let p denote an 
absolutely irreducible representation of G into GLa(k). (In fact all we shall 
use in the sequel is that k is the centraliser in Ma(k) of the image of p.) Let 
Do denote the category of profinite O|G|-modules with continuous morphisms. 
We will let D denote a full subcategory of Dp which is closed under taking sub- 
objects, quotients and direct products and which contains M;. Note that if 
M is an object of Dp and M; is a collection of subobjects which have trivial 
intersection and such that each M/M; is an object of D, then M is an object 
of D, since M C ] |; M/Mi. 

Let y : G — O* be a continuous character such that det p = x mod A. By 
a lifting of p of type D = (O, x, D) we shall mean an object R of Co and a 
continuous representation p : G — GLalR) such that: 


(a) p mod mp = f, 
(b) det p = x, 
(c) M, is an object of D. 
Note that if ọ : R — R’ and p : G > GL,(R) is of type D so is go p. 


Theorem 2.36 There is a lifting p¥'Y : G — GLal(Rp) of p of type D such 
that if p : G — GLalR) is any lifting of p of type D then there is a unique 
homomorphism of O-algebras 6: Rp —> R such that p is conjugate to do py’. 
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The representation p¥5"" is referred to as the universal deformation of type 
D. Mazur [Maz3] proved this theorem for Do and certain other categories D. 
Ramakrishna [Ram] observed that the arguments work with any category D 
satisfying the above hypotheses. We will sketch a proof which was suggested 
by Faltings. (We remark that another explicit construction of the deformation 
ring has been given by de Smit and Lenstra in [dSL].) 

Proof of theorem 2.36: Choose a sequence gj,...,g, of topological generators 
of G and liftings Aj,...,A, of A(g1),..., (gr) to Ma(O). Define a mapping 
tu: Ma(O) — Ma(O)” which sends x to (Ai — Aiz,...,.cA, — Arr). Since + 


has torsion-free cokernel, so we can decompose 
Ma(O)" = e(Ma(O)) ® V, 


for some submodule V C M,(O)". If p : G > GLa(R) is a lifting of p of type 
D set v, = (p(g1) — Ai, ---, P(gr) — Ar) E€ Mal R)”. Note that v, = 0 mod mr 
and that v, completely determines p. We call the lifting p well-placed if v, 
belongs to V &o R C Ma( R)". The crucial observation is the following result. 


Lemma 2.37 If p: G — GL,(R) is a lifting of p of type D then there is a 
unique conjugate p' of p which is well-placed. 


The lemma is first proved for algebras R such that m% = (0) by induction on 
n, and then one deduces the general case. 

In virtue of the lemma it suffices to find a universal well-placed lifting of 
type D. Let e1, ..., €s be a basis of V as an O-module. If p is a well-placed lifting 
of p of type D then we can write v, = Bee Up,ie€i for unique elements vp; E MR 
and we can define a homomorphism 6, : O|[T},...,7s]] —> R sending T; to vpi. 
Note that p is completely determined by 4, (p(gi) = Ait > 05—1 0p(Tj)eji, where 
ej = (€j1,---,ejr)). Let J denote the intersection of all ideals J of O[[T}, ..., Tal] 
such that there is a representation pj : Œ > GL,(O[[T),...,T5]|/J) of type 
D with ps(gi) = Ai + 05-1 Tjeji for all i. Let Rp denote the quotient of 
O[[T,, ..., Ts]] by Z. Then one can check that there is a representation p"™” : 
G > GLa(Rp) with p™™(g:) = Ai + X 5-1 Tjej; for all i, and that this is the 
desired universal representation. o 

We will need a few elementary properties of these universal deformations. 
More precisely we will need to know how these universal rings change when 
we change the base field, we will need to know how to calculate the equi- 
characteristic tangent space of these rings and more generally how to calculate 
o/g? for certain prime ideals p. The first of these lemmas is a remark of 
Faltings, the second is due to Mazur [Maz3] and the third to Wiles [W3]. 
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Lemma 2.38 Let K'/K be a finite extension with ring of integers O' and 
residue field k'. Let D’ denote the full sub-category of the category of profinite 
O'|G]-modules such that the underlying object of Do is actually an object of D. 
Let D' = (O'",x,D’). Then Rp: = Rp 80 O' and ps’ = pH" @1. 


Proof. Let Rp denote the subring of Rp consisting of elements which reduce 
modulo the maximal ideal to an element of k C k’. Similarly let Rp denote the 
subring of Rp 8o O’ consisting of elements which reduce modulo the maximal 
ideal to an element of k C k’. Then p% is in fact valued in GL4(Rp:) and 
piv @ 1 is in fact valued in GLg(Rp). The universal properties give natural 
maps a: Rp > Bp and 3: Rp — Rp 80o O'. Moreover they show that the 
composite (a ®1)08: Rp — Rp, is the identity and that Goa: Rp > Rp 
is the natural embedding. Thus ( is an isomorphism. oO 

We will let D™ denote the full subcategory of D whose objects are killed 
by A”. Suppose that M is an object of Dp which is finite and free over 
O/X". Recall from section 2.4 that H'(G,End(M)) may be identified with 
Extn (M, M). If M is an object of D™ which is finite and free over O/A", 


then we have a natural inclusion 


Extpm(M, M) C Ext! (M, M) S H'(G, End (M)). 


(n) 
Do 


We define H}(G,End(M)) to be the image of Exthm (M, M) in the group 
H'(G,End(M)), and H4(G,End°(M)) to be the intersection 


H! (G, End °(M)) N H} (G, End(M)). 
Lemma 2.39 There is a canonical isomorphism of k-vector spaces 
Hom; (mrp /(A, MRp) k) = H5(G, adp). 


Proof. There is a natural bijection between Hom; (mg, /(A, m}, ), Æ) and the 
set of O-algebra homomorphisms from Rp to the algebra ke] where £? = 0 (the 
correspondence associates ¢: Rp — kļ|e] to mn, )- Hence there is a bijection 
to the set of liftings p : G — GLa(kle]) of p of type D, modulo conjugation by 
elements of 1+ ¢M,(k). On the other hand, recall from section 2.4 that there 
is a natural bijection between Ext o) (Mp, Mp5) and the set of of all continuous 
liftings p : G — GL, (kle]) of p modulo conjugation by elements of 1+¢M,(k). 
Moreover a lifting p is type D if and only if det p = det p and the corresponding 
extension M, is an object of D. The lemma follows on checking linearity. 

Now suppose that 0 : Rp — O is an O-algebra homomorphism. Let ø 
denote the kernel of 6 and let p = 6 0 py". We set 


H}(G,ad°p & K/O) = lim H} (G, ad°p & \-"/O) c H'(G,ad°9 @ K/O). 
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Lemma 2.40 There is a canonical O-linear isomorphism 
Homo(9/*, K/O) = Hp(G,ad°p 8 K/O). 


Proof: One shows in a very similar manner to the proof of lemma 2.39 that 
for all n there is a natural isomorphism. 


Homo (9/9, O/A") = Hp(G, ad°p 8 O/A"). 


2.7 Deformations of Galois representations 


Again in this section we assume that £ is an odd prime. Let J : Gg —> GLa(k) 
denote a continuous absolutely irreducible representation. Suppose moreover 
that det p = e and that J is semi-stable in the sense that 


e plc, is semi-stable, 


e and if p £ £ then #p(J,)|é. 


Note that if E/Q is a semistable elliptic curve then pry : Gg > GLo(Fe), sat- 
isfies these conditions if it is irreducible. By a theorem of Mazur (theorem 2.9) 
this will be the case if £ > 7. 

Let X denote a finite set of prime numbers. If R is an object of Co then 
we say that a continuous lifting p : Gg —> GL2(R) of p is of type X if the 
following hold. 


e det p =e. 

e plc, is semi-stable. 

e If LZ X and pe, is good then pg, is good. 

e Ifp¢g XU {¢} and Jis unramified at p then p is unramified at p. 


o Ifp¢g XU {é} then plr, ~ (i) 


Roughly speaking we require that at primes p ¢ X, p is as unramified as could 
be hoped for and we require that p|g, is semi-stable. Note that if © c X’ and 
p is a lifting of type È then it is also a lifting of type X’. Note also that if E /Q 
is an elliptic curve which is semi-stable at £ and for which Jg is irreducible 
and semi-stable, then pge: Gg > GLo(Zp) is a lifting of type X if X contains 
all the primes for which E has bad reduction. 

Now suppose that p : Gg > GL2(O/A") is a lifting of p of type ©. We will 
write H}(Q, ad°p) for H} (Q, ad°p), where 
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( 
e Ls.» = H!(G,, (ad°p)) if p € © and pF &; 
e Lye = H} (Ge, (ad?p)) if £ g J: 
e Lye = HL (G, (ad°p)) if lE E. 


Note that the pairing ad°p x ad°p > O/X" given by (a,b) > tr (ab) is perfect 
and respects the action of Gig (i.e. tr ((ad°p)(g)a, (ad°p)(g)b) = tr (ab) for all 
g € Gog). Choosing a generator for the O-module Hom(QO/A", Qe/Ze), we 
obtain an isomorphism of O[Gg|-modules 


ad°p(1) = Home(ad°p, O/A")(1) > Hom(ad°p, Qe/Ze)(1) = (ad? p)*. 


The O-submodule of H!(Q,ad°p(1)) corresponding to Hz. (Q, (ad°p)*) is in- 
dependent of the choice of generator and we denote it H4(Q,ad°p(1)). Thus 
H}(Q,ad"p(1)) is defined by the local conditions {L£} where the orthogo- 
nality is with respect to the pairing 


H'(G,,ad"p) x H'(Gy, ad°p(1)) > Qe/Ze 


arising from the above isomorphism. We may equivalently regard the orthog- 
onality as being with respect to the natural perfect O-bilinear pairing defined 
by the composition 


H'(Gy, ad°p) x H*(Gy,ad°p(1)) > H?(Gy, ad°p @o ad’p(1)) 
> HG, OX) YO/M. 


Note that if p 4 @ we have that 
S LS. n I (Gp/Tp, (ad°p)(1)/”) if p ¢ X; 


e Ls, = (0) ifpeX. 

If p : Go > GL2(O) is a lifting of J of type £, we will write H4(Q,ad°p & 
K/O) for the direct limit lim H4(Q, ad°p@A~"/O), and H4(Q, ad°p(1)@K/O) 
for the direct limit lim H$(Q, ad°p(1) & A~"/O). 

The main result we need about deformations of J is the following. 
Theorem 2.41 There is a universal lifting p8'Y : Gg —> GLo(Rs) of p of 
type ©, i.e. pœ is a lifting of type X and if p : Gg > GL2(R) is any lifting 


of type & then there is a unique O-algebra homomorphism @: Ry — R such 
that p ~ po px’. Moreover we have the following. 


TT 


(a) If K'/K is a finite extension and RS is the corresponding deformation 
ring then RS = Ry 80 O' and (PY = pH’ @ 1. 


(b) Rs can be topologically generated as an O-algebra by dim; H4(Q, ad°p) 
elements. 


(c) If @: Ry — O is a O-algebra homomorphism, if p = @° py and if 
p =ker@ then Hom(g/p?, K/O) X H4(Q,ad°p & K/O). 


Proof: Let Lo denote the fixed field of p. Let Ln (for n € Zs) denote the 
maximal elementary abelian (extension of Ln-ı unramified outside X, {4} 
and the primes where p ramifies. Let Lao = U,, Ln and let G = Gal (La /Q). 
Note that any lifting of p of type ÈX factors through Œ. Gal (L/L) is a pro- 
é-group and its maximal elementary abelian quotient, Gal (L1/Lo), is finite 
by theorem 2.2. We deduce from the following lemma that Gal (ZL./Zo) and 
hence G are topologically finitely generated. (See for instance [Koc] Satz 4.10 
for a proof of this lemma.) 


Lemma 2.42 Let H be a pro-¢-group and H its maximal elementary abelian 
quotient. Suppose hı, ..., h, E H map to a set of generators of H, then hy,..., hy 
topologically generate H. 


Let D denote the category of profinite O|G|-modules M for which 
e M is semi-stable as an O|G;|-module, 
e if L ¢»> and if J is good then M is good as an O[G;|-module, 


e ifp g NU {¢} and if J is ramified at p then there is an exact sequence of 
O|I,|-modules 
(0) — MY) = M = M© — (0), 


such that J, acts trivially on MT) and M®. 
Then we see that a lifting p : Gg — GL»2(R) of J is of type ÈX if and only if 
e p factors through G, 
e det p =e, 


e M, is an object of D. 
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The existence part of theorem 2.41 now follows from theorem 2.36 and part (a) 
follows from lemma 2.38. Parts (b) and (c) follow from lemmas 2.39 and 2.40, 
and the following observation: 

If p : Gg > GL2(O/A”) is a lifting of type X and if p g X U {I} is a prime 
where p ramifies then 


ker (H* (Gp, ad°p) > H' (Ip, ad°p/(ad°p)”)) = H*(Gp/ Ip; (a° p)” ). 
(2.7.1) 


Equation (2.7.1) follows from the fact that the natural map H'(J,,ad°p) > 
H'*(I,,ad°p/(ad°p)!”) is an injection (in fact an isomorphism). It is nothing 
other than the map (ad°p);, > (ad°p/(ad°p)’”);,.. This completes the proof 
of theorem 2.41. 


Corollary 2.43 Suppose that if L = 3 then Pls =x is absolutely irreducible. 
Then Ry can be topologically generated as an O-algebra by 


dim, Hy(Q, ad°p(1)) +de+ X` dim, H°(Q,,ad°A(1)) 
pEex—{e} 
elements, where de = dim, H} (Q;, ad°p) — dim, H}(Qy,ad°p) if £ € £, while 
de =0 iflg. 


Proof: Note that H°(Q,ad°p(1)) = 0 unless £ = 3 and lg, — is not abso- 
Q(v=3) 


lutely irreducible. Thus according to theorem 2.18, dim, H4(Q, adJ) is the 
sum of the following terms. 


e dim, H4(Q, ad°A(1)). 


e dim, H}(Q;, ad°p)—dim, H? (Q;, ad°p)—dim,; H°(R, ad°p) < 0 by propo- 
sition 2.27. 


© dy. 


e For each p € © — {1}, dim, H!(Q,, ad’) — dim, H? (Qp, adp) = 
dim, H°(Q,,ad°p(1)) by the local Euler characteristic formula (see the- 
orem 2.17). 
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2.8 Special cases 


In this section we will restrict attention to the deformation problems associated 
with sets Q of certain special primes g. These sets Q are chosen so that the 
associated deformation ring Rg enjoys a number of special properties which 
will be crucial in the sequel. In particular we will only consider sets Q of 
primes q which satisfy the following two properties 


e q = 1 mod £, 
e fis unramified at q and p(Frob,) has distinct k-rational eigenvalues. 


For such primes q we will let A, denote the maximal quotient of (Z/qZ)”* 
of l-power order. Then A, is naturally a quotient of both G} and Gg via the 
map 

X4 : Go — Gal (Q(¢q)/Q) = (Z/qZ)* — Ag. 
We will let Ag = Į [leo Aa XQ = Ilco Xa : Go —> Aq and ag denote the 
augmentation ideal of O[Ag]. Note that there is an isomorphism O[Ag] = 
O[[S, : q € QIj//( + $,)*#4¢ — 1 : q € Q) under which ag corresponds to 
(Sq: ¢ EQ). 

For each q € Q, we choose an eigenvalue a, of p(Frob,) and denote the 
other 64- 


Lemma 2.44 [fq E€ Q then pèle, is conjugate to « zi) for some char- 
acter € such that E(Frob ;) = dq. 


Proof: Choose a lifting f € G, of Frob}. As p(f) has distinct k-rational 
eigenvalues it is a simple application of Hensel’s lemma to see that we can 


į Q Z ~ 
choose a basis such that pg"’(f) = o B where @, and 4 reduce to ag 
q 


univ 


and 3, in k. It suffices to show that for any o € I, pg" (0) is diagonal in this 
basis. Suppose pg"’(o) = 12 + © t) with a,b,c,d € Mro- Because py" is 
tamely ramified at q we see that p (PPE (opgi (A) = pa (0). Thus 
(Gq/8, — q)b and (8,/G_ — q)c lie in Mpro(b, c) so that (b,c) = mr,(b,c) and 
(by Nakayama’s lemma) b = c = 0. m 

We let €q denote the character € : G, — Rg in the conclusion of the 
lemma. The restriction of ég, to I, factors through xq. We let €g denote the 
character Go — Rg which is unramified outside the primes of Q and whose 
restriction to I, is 4g for each q in Q. Thus €g factors through yg. We wish 
to regard Rg as an O[Ag]-algebra, and it will be most convenient to do so 
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via the map which gives rise to oa Note the following consequence of the 
lemma, 2.44. 


Corollary 2.45 The natural map Ro — Rg gives rise to an isomorphism 
Rg/agRae 5 Ro. 


Lemma 2.46 (a) Ifq € Q then H°(F,,ad°p) = H°(F,,ad°p(1)) = k and 
H'!( Fg, ad”) a H Fg, ad”p(1)) =k. 


(b) Ro can be topologically generated as an O-algebra by 
#Q + dim, H6 (Q, ad°p(1)) 
elements. 


(c) If 


Hy(Q, ad°p(1)) > EB H (Fy, ad°p(1)) 
qEQ 
then #Q = dim, H} (Q, ad°2(1)) and Ro can be topologically generated 
as an O-algebra by #Q elements. 


Proof: The first part is a direct calculation using the fact that Frob, acts 
semi-simply on ad? with eigenvalues x, 1,x7} for some x € k\{0,1}. The 
same is true for ad°p(1) as q = 1 mod £. The second and third parts follow 
from this and corollary 2.43. o 

For the proof of the last theorem of the chapter, we shall need two results 
on finite groups. 


Theorem 2.47 (a) If H is a finite subgroup of PGL2(C) then H is iso- 
morphic to one of the following groups: the cyclic group Cn of order n 
(n € Zs), the dihedral group Don of order 2n (n € Z>1), Ag, S4 or As. 


(b) If H is a finite subgroup of PGLa(F¢) then one of the following holds: 


e H is conjugate to a subgroup of the upper triangular matrices; 
e H is conjugate to PSL2(Fer) or PGL2(Fpr) for some r € Zyo; 


e H is isomorphic to A4, S4, As or the dihedral group Də of order 
2r for some r € Zs, not divisible by £. 


In fact we shall only need part (b) which is due to Dickson [Dic2], secs. 255, 
260 (see also [Hu] II.8.27), but we have included part (a) for later reference. 
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Lemma 2.48 Let F be a finite field of odd characteristic £. If ##F 4 5, then 


02) = 0. 


Proof. This is a special case of results of [CPS], table 4.5 (assuming #F # 3). 
In fact we shall only need it in the case 0 = 3, but the proof in the general 
case is no more difficult, and we sketch it here for the reader’s convenience. 

We let B (resp. U) denote the group of upper-triangular (resp. unipotent) 
matrices in Œ = SL2(F). Since @ does not divide the index of B in G, the 
restriction homomorphism 


= 


H'(SL2(F), End °( 


H'(G,End°(F’)) > H'(B, End °(F?)) 


is injective, so it suffices to prove the latter group vanishes. Since @ does not 
divide the index of U in B, we have 


H'(B,M) = H*(U, uy” 


all integers i > 0 and F[B]-modules M. If #F = 3, then one checks directly 
that for M = End °(F3), 


H+ (U, M) & ker N/(o —1)M =0 


where o generates U and N = 1+0 +0? on M. If #F > 5, then one proceeds 
by writing 


(0) = Mo C Mı C Mp C M; = End? (F°) 


as F|B]-modules where the successive quotients M;/M;_1, (for i = 1, 2, 3) are 
one-dimensional over F. The calculation is then straightforward using long 
exact sequences of cohomology, except in the case #F = 9 where one must 
also check that the one-dimensional space of classes in H!(U, M3/M3) fixed by 
B/U maps injectively to H?(U,M2/M,) via the connecting homomorphism. 
O 


Theorem 2.49 Keep the assumptions of the last section and suppose more- 
over that if L = Q(/(-l)“Y/2) then Pla, is absolutely irreducible. Then 
there exists a non-negative integer r such that for any n E€ Zyo we can find a 
finite set of primes Qn with the following properties. 


(a) Ifq € Qn then q = 1 mod 8. 
(b) Ifq € Qn then p is unramified at q and p(Frob ,) has distinct eigenvalues. 
(c) H#OQn =, 
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(d) Ro, can be topologically generated by r elements as a O-algebra. 


Proof, Take r = dim, Hj(Q,ad°(1)). It suffices to find a set Q, with the 
following properties. 


(a) If q E€ Qn then q = 1 mod &. 
(b) If q € Qn then f is unramified at q and p(Frob ,) has distinct eigenvalues. 
(c) Hy(Q,ad"A(1)) + Bye, H (Fa, ad’a(1)). 


As each H'(F,,ad°p(1)) is one dimensional we may replace the last condition 
with 


Hy(Q,ad°p(1)) > GB A (F,, ad°p(1)). 
qEQn 
Thus what we need show is that for each non-zero class [y] € Hj (Q, ad°A(1)) 
there is a prime q (depending on [7)]) such that 


(a) q= 1 mod ?", 


(b) Jis unramified at q and p(Frob ,) has distinct eigenvalues, 


(c) res,[] € H'(IF,,ad°a(1)) is nontrivial. 


Using the Chebotarev density theorem we see that it will do to find o € Gg 
such that 


(a) Clam) =1, 


(b) ad°p(c) has an eigenvalue other than 1, 


(c) (o) g (a — 1)ad"p(1). 


For m > 0, let Fm denote the extension of Q(Cgm) cut out by ad°p; i.e., 
the field fixed by the kernel of the representation ad°f restricted to GQ(em): 
We will show that ~(Gzp,) is non-trivial. For this it suffices to prove that 
H'(Gal (F,/Q), ad°p(1)) = (0). Consider the inflation-restriction exact se- 
quence 


(0) + H*(Gal (Fo/Q), (ad°A(1))°™) > H' (Gal (F,/Q), ad°A(1)) 
— H(Gal (F,,/ Fo), ad°p(1))°e. 
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Since F/Fo is an extension of degree prime to £, and since Gg acts trivially 
on Gal (F;,/F), we have: 


H'(Gal (F,,/Fo), ad°a(1))°2 ~ Hom(Gal (F,,/F,), ad°p(1)°2). 
(2.8.1) 


Since plc, is absolutely irreducible, the cohomology group in equation (2.8.1) 
vanishes. On the other hand, Gpr, acts trivially on adJ so the first term van- 
ishes as well unless Gal (Fọ/Q) has order divisible by @ and has Gal (Q(¢2)/Q) 
as a quotient. Recall that Gal (Fo/Q) is isomorphic to the projective image of 
p, so by theorem 2.47 we are reduced to the case £ = 3 and the map 


Gal (Fo/Q(¢3)) > PSL2(k) 


has image conjugate to PS'L2(F3-) for some r > 1. It suffices to prove that in 
this case 


H’ (Gal (Fo/Q(¢s)), ad" ®x k)) = (0), 


but this follows directly from lemma 2.48. 

Now note that since f|g, is absolutely irreducible, so is Pleg ee Regarding 
w(Gp,) as a Gal(F,,/Q(Cm))-submodule of ad°(p), we find that some g € 
Gal (Fa/Q(Çen)) of order not dividing £ fixes a non-zero element of y(G p, ). 
Let co be a lifting of g to Gaye). We will look for o = Too with T € Gp,. We 
only need choose 7 so that Y(Too) = Y(T) + (a0) Z (oo — 1)ad®A(1). This is 
possible because Y(Gp) É (g — 1)ad°p(1). 


3 Modular forms and Galois representations 


3.1 From modular forms to Galois representations 


We suppose in this section that f = $` an(f)q” is a newform of weight two and 
level Ny (see definition 1.21). Let K; denote the number field in C generated 
by the Fourier coefficients a,(f). Let Ww, denote the character of f, i.e., the 
homomorphism (Z/N;Z)* — K; defined by mapping d to the eigenvalue of 
(d) on f. 

Recall that a construction of Shimura (section 1.7) associates to f an 
abelian variety Aş of dimension [Ky : Q]. This abelian variety is a certain 
quotient of J;(Vy), and the action of the Hecke algebra on J;(Ny) provides an 
embedding 

K; => Endo(Ap) & Q. 
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We saw also that for each prime £ the Tate module Te(Af) ®z, Qe becomes a 
free module of rank two over K; ® Q; (lemma 1.48). The action of the Galois 
group Gg on the Tate module commutes with that of Ky, so that a choice of 
basis for the Tate module provides a representation 


As K; ® Q can be identified with the product of the completions of Ky at its 
primes over 4, we obtain from f certain 2-dimensional ¢-adic representations 


of Go. 


(-adic representations: In this discussion, we fix a prime ¢ and a finite 
extension K of Q;. We let O denote the ring of integers of K, A the maximal 
ideal and k the residue field. We shall consider -adic representations with 
coefficients in finite extensions of our fixed field K. We regard K as a subfield 
of Q; and fix embeddings Q — Q; and Q — C. If K’ is a finite extension of 
K with ring of integers O’, then we say that an ¢-adic representation Gy —> 
GL2(K’) is good (respectively, ordinary, semistable) if it is conjugate over 
K’ to a representation Ge —> GL»2(O’) which is good (respectively, ordinary, 
semistable) in the sense of section 2.4. 

Let K f denote the K-algebra in Q; generated by the Fourier coefficients of 
f. Thus K f is a finite extension of K, and it contains the completion of Kẹ at 
the prime over ¢ determined by our choice of embeddings. We let O; denote 
the ring of integers of A} and write ky for its residue field. We define 


ps: Go > GLo(K%) 


as the pushforward of (3.1.1) by the natural map Ks ® Qe > K}. We assume 
the basis is chosen so that pp factors through G'L2(O%). We also let Y, denote 
the finite order ¢-adic character 


Go — Gal (Q(¢w,)/Q > (Kp) 


obtained from w. 

The following theorem lists several fundamental properties of the ¢-adic 
representations pf obtained from Shimura’s construction. The result is a com- 
bination of the work of many mathematicians. We discuss some of the proofs 
and provide references below. In the statement we fix f as above and write 
simply N, an, p, Y, W and K' for Ny, an(f), Pr, Wr, Vy and K; respectively. 


Theorem 3.1 The ¢-adic representation 
Pp: Go =R GL(K') 


has the following properties. 
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(a) If pXN@ then p is unramified at p and p(Frob ») has characteristic poly- 
nomial 
X? — aX + py(p). 


(b) det(p) is the product of y' with the ¢-adic cyclotomic character €, and 
p(c) is conjugate to (ò 1) 


(c) p is absolutely irreducible. 
(d) The conductor N (p) is the prime-to-l-part of N. 


(e) Suppose that p # £ and p||N. Let x denote the unramified character 
Gp —> (K')* satisfying x(Frob p) = ap. If p does not divide the conductor 
of p, then pla, is of the form 


€ * 
(0x): 
If p divides the conductor of Y, then p\q, is of the form 
x ‘eva, p X: 


(f) IFLI2N, then p|G, is good. Moreover p|, is ordinary if and only if ag is 
a unit in the ring of integers of K', in which case pr, (Frob ¢) is the unit 
root of the polynomial X? — aX + ly(é). 


(g) If £ is odd and £||N, but the conductor of y is not divisible by £, then 
pla, is ordinary and pr (Frob ¢) = ap. 


Proof: Part (a) was established by Shimura ([Shi2], [Shi3]). The key ingredi- 
ent is the Eichler-Shimura congruence relation, theorem 1.29. Recall that 
Jı(N) has good reduction at primes p not dividing N. So the action of 
Gp on (Af) z, Qe is unramified and is in fact described by the action of 
Frob, € Gr, on the Tate module of the reduction. But this is given by the 
Frobenius endomorphism F whose characteristic polynomial is computed in 
corollary 1.41. 

The first assertion of (b) follows from (a) on applying the Chebotarev 
density theorem. The second assertion then follows on noting that w(—1) = 1. 

Part (c) was proved by Ribet (see section 2 of [R3]). Assuming reducibility 
of the representation, he applies algebraicity results of Lang and Serre to 
obtain a contradiction to the estimate on the Fourier coefficients stated in 
theorem 1.24. 
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Parts (d) and (e) follow from a deep result of Carayol [Cal], Thm. (A), 
building on the work of Langlands [L11], Deligne and others. In fact, this 
result and the local Langlands correspondence characterize p|g, in terms of 
wlq, and the L- and e-factors at p of twists of f. The descriptions in the case 
of p||N are based on the analysis of Deligne-Rapoport of the reduction mod p 
of Jı(N) (see [DR], [L11]). 

The first assertion of (f) follows from the fact that A, has good reduction 
at l if £ does not divide N. The second assertion of (f) (respectively, all of (g)) 
follows from the Eichler-Shimura congruence relation (respectively, the work of 
Deligne-Rapoport), and general results on ¢-divisible groups and the reduction 
of abelian varieties; see thm. 2.2 of [W2], lemma 2.1.5 of [W1], §12 of [Gro 
and thms. 2.5 and 2.6 of [Edi]. The restriction to odd £ in (f) and (g) is made 
primarily out of lack of suitable definitions. 


Mod ¢ representations: We maintain the notation used in the discussion of 
¢-adic representations. Define 


Pr à Go = GLoalkp) 


to be the semi-simplification of the reduction of pz. (See the discussion fol- 
lowing proposition 2.6.) Assertions analogous to those in theorem 3.1 hold for 
P = pry, except that 


e The representation need not be absolutely irreducible (as in (c)). How- 
ever if l is odd, one checks using (b) that p is irreducible if and only if 
it is absolutely irreducible. 


e In (d) one only has divisibility of the prime-to-¢ part of Ny by N(p) in 
general. 


The various possibilities for m,(p) to be strictly less than the exponent of 
pin N (where p ¥ £) were classified independently by Carayol [Ca2] and Livné 
[Liv]. We record the following consequence of their results (cf. the introduction 
of [DT1}): 


Proposition 3.2 Suppose that p is a prime such that p|N;, p # 1 mod £ and 
pr is unramified at p. Then tr (ps(Frob,))? = (p+ 1)? in ky. 


Artin representations: The theory of Hecke operators and newforms (see 
section 1.3) extends to modular forms on T(N) of arbitrary weight. The 
construction of (-adic representations associated to newforms was generalized 
to weight greater than 1 by Deligne [De] using étale cohomology. There are 
also Galois representations associated to newforms of weight 1 by Deligne and 
Serre [DS], but an essential difference is that these are Artin representations. 
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Theorem 3.3 If g = J an(g)q” is a newform of weight one, level N, and 
character p4, then there is an irreducible Artin representation 


py: Gg —+ GLa(C) 


of conductor N, with the following property: If p{Ng, then the characteristic 
polynomial of pg(Frob p) is 


X? — ay(g)X + Yg(p). 


Remark 3.4 Note that det(p,) is the character of Gg corresponding to y% and 
Pg(c) is conjugate to (à a) (see theorem 3.1). 


Remark 3.5 A basis can be chosen so that the representation p, takes values 
in GL2(K,) (where K, is the number field generated by the a,(g)). Moreover 
suppose that K is a finite extension of Q, in Q; and we have fixed embeddings 
of Q in C and Q,). If K, is contained in K, then we can view p, as giving rise 
to an -adic representation Go — GL2(K) and hence a mod £ representation 
Go = GLə(k). 


Remark 3.6 A key idea in the construction of py is to first construct the mod 
l representations using those already associated to newforms of higher weight. 
More precisely, suppose that Kọ — K as in remark 3.5. One can show that 
for some newform f of weight 2 and level Ny dividing N£ we have 


ar(g) = alf),  YValp)) = pve (p) 


for all p/N2@, the congruence being modulo the maximal ideal of the ring 
of integers of A}. Thus py is the semisimplification of the desired mod £ 
representation (with scalars extended to ky). 


3.2 From Galois representations to modular forms 


It is conjectured that certain types of two-dimensional representations of Gg 
always arise from the constructions described in section 3.1. We now state 
some of the conjectures and the results known prior to [W3] and [TW]. 


Artin representations: 


Conjecture 3.7 Let p : Gg —> GL2(C) be a continuous irreducible represen- 
tation with det(p(c)) = —1. Then p is equivalent to pg for some newform g of 
weight one. 
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Recall that p, is the Artin representation associated to g by the Deligne-Serre 
construction (theorem 3.3). 


Remark 3.8 Conjecture 3.7 is equivalent to the statement that the Artin 
L-functions attached to p and to all its twists by one-dimensional characters 
are entire. (The Artin conjecture predicts that the Artin L-function L(s, p) is 
entire, for an arbitrary irreducible, non-trivial Artin representation p : Go —> 


GLa(C).) 


A large part of conjecture 3.7 was proved by Langlands in [L12], and the 
results were extended by Tunnell [Tu]. 


Theorem 3.9 Let p : Gg — GL2(C) be a continuous irreducible representa- 
tion such that p(Gg) is solvable and det(p(c)) = —1. Then p is equivalent to 
Pq for some newform g of weight one. 


Remark 3.10 Note that by theorem 2.47, part (a), the solvability hypothesis 
excludes only the case where the projective image of p is isomorphic to As. 


Remark 3.11 If the projective image of p is dihedral, then p is induced from 
a character of a quadratic extension of Q. In this case the result can already 
be deduced from work of Hecke. 


Mod ¢ representations: We fix notation as in the discussion of ¢-adic and 
mod £ representations in section 3.1. 


Definition 3.12 We say that a representation p : Gg —> GLa(k) is modular 
(of level N) if for some newform f of weight 2 (and level N), p is equivalent 
over ky to py. 


By proposition 1.32 the notion is independent of the choices in section 3.1 
of embeddings K — Q,, Q — Qı and Q — C. Moreover if K’ is a finite 
extension of K with residue field k’, then p is modular if and only if p 8x k’ is 
modular. 

The following was conjectured by Serre [Se7], (3.2.3). (See also [Dal] for 
further discussion and references.) 


Conjecture 3.13 Let p: Gg —> GL2(k) be a continuous absolutely irreducible 
representation with det(p(c)) = —1. Then p is modular. 


Some cases of Serre’s conjecture can be deduced from theorem 3.9. 
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Theorem 3.14 Let p : Gg — GL2(k) be a continuous absolutely irreducible 
representation with det(p(c)) = —1. Suppose that one of the following holds: 


(a) k is F3; 


(b) the projective image of p is dihedral. 


Then p is modular. 


Sketch of proof: For case (a), we consider the surjection 
GIAN AC) 


defined by reduction mod (1 + /—2). One checks that there is a section 
s : GLo(F3) — GLə(Z|v—2]) and applies theorem 3.9 to s o p. The resulting 
representation arises from a weight one newform, and hence its reduction p is 
equivalent to pr for some f (see remark 3.6). 


In case (b), Ø is equivalent to a representation of the form Ind ae where 
F is a quadratic extension of Q and £ is a character Gp — k*. (We have 
here enlarged K if necessary.) Let n be the order of €, choose an embedding 
Q(e?"/") — K and lift € to a character € : Gp — Zle?"/"|*. We may 
always choose € so that the Artin representation p = Ind ae is odd, i.e., 
det(p(c)) = —1. (In the case @ = 2 and F real quadratic, we may have to 
multiply € by a suitable quadratic character of Gp.) We then apply theorem 3.9 
to p and deduce as in case (a) that p is modular. 

Serre also proposed a refinement ([Se7], (3.2.4)) of the conjecture which 
predicts that p is associated to a newform of specified weight, level and char- 
acter. Through work of Mazur, Ribet [R5], Carayol [Ca2], Gross [Gro] and 
others, this refinement is now known to be equivalent to conjecture 3.13 if £ is 
odd. (One also needs to impose a mild restriction in the case £ = 3.) See [R6] 
and [Dil] for statements of the results and further references; here we give a 
variant which applies to newforms of weight two. Before doing so, we assume 
is odd and define an integer 6(p) as follows: 


e 6(p) = 0 if plc, is good; 
e 6(p) =1 if pla, is not good and |r, Qx k is of the form 
4 € * 0 
(oi) (Ge) o (ye) 


for some positive integer a < £. (Recall that € is the cyclotomic character 
and ~w is the character of I; defined after lemma 2.22.) 
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e (p) = 2 otherwise. 


Theorem 3.15 Suppose that £ is odd and p is absolutely irreducible and mod- 
ular. If € = 3, then suppose also that Plena is absolutely irreducible. Then 
there exists a newform f of weight two such that 


e p is equivalent over ky to pf; 
o Ny = N(p) 0), 
e the order of wy is not divisible by £. 


Proof: The existence of such an f follows from [Dil] thm. 1.1, thm. 5.1 and 
lemma 2.1, but with Ny dividing N(p)°™. By lemma 2.7 above, we see that 
Ny is divisible by N(p). The divisibility of Ny by 6(f) follows from results in 
sec. 8 of [Gro] and sec. 2.4 of [Edi]. o 


l-adic representations: We again use the notation of section 3.1. Let p : 
Go — GL2(K) be an f-adic representation. 


Definition 3.16 We say that p is modular if for some weight 2 newform f, p 
is equivalent over K; to py. 


The notion is independent of the choices of embeddings and well-behaved under 
extension of scalars by proposition 1.32 (cf. definition 3.12). 
The following is a special case of a conjecture of Fontaine and Mazur [FM]. 


Conjecture 3.17 If p : Gg — GL2(K) is an absolutely irreducible ¢-adic 
representation and p|q, is semistable (in the sense of section 2.4), then p is 
modular. 


(Recall that for us ¢-adic representations are defined to be unramified at all but 
finitely many primes. Recall also that if p|c, is semistable, then by definition 
det p|z, is the cyclotomic character e.) 


Remark 3.18 Relatively little was known about this conjecture before Wiles’ 
work [W3]. Wiles proves that under suitable hypotheses, the modularity of p 
implies that of p. 


Remark 3.19 The conjecture stated in [FM] is stronger than the one here; 
in particular, the semistability hypothesis could be replaced with a suitable 
notion of potential semistability. On the other hand, one expects that if plq, is 
semistable, then it is equivalent to pz (over K4) for some f on Ty (N(p))NT (4) 
(and on ['\(N(p)) if pla, is good). 


91 


The Shimura-Taniyama conjecture: Conjecture 1.54 can be viewed in the 
framework of the problem of associating modular forms to Galois representa- 
tions. Let FE be an elliptic curve defined over Q. For each prime £, we let pge 
denote the ¢-adic representation Go — GL2(Q,) defined by the action of Go 
on the Tate module of E (see section 2.2). 


Proposition 3.20 The following are equivalent: 


(a) E is modular. 
(b) pre is modular for all primes £. 
(c) pry is modular for some prime £. 


Proof: If E is modular, then E is isogenous to A+ for some weight two newform 
f with Ky = Q (see section 1.8). It follows that for each prime @, pge is 
equivalent to the ¢-adic representation py. Hence (a) > (b) = (c). 

To show (c) = (b), suppose that for some ¢ and some f, the representations 
Pre and py are equivalent. First observe that for all but finitely primes p, we 
have 

tr (p;(Frob,)) = tr (pn¢(Ftob »)). 


We deduce from proposition 2.11 and theorem 3.1, part (a) that for all but 
finitely many primes p 


alf) =pt+1— # Epl Fp) € Z. (3.2.1) 


Applying proposition 2.6, we find that for each prime £, pg, is equivalent to 
pr and is therefore modular. 

We finally show that (b) = (a). The equality (3.2.1) holds for all primes 
p not dividing Ny, which by theorem 3.1, part (d), is the conductor of E. 
Since det(p;) = det(pze) = €, we see by 3.1, Part (b) that Yẹ is trivial. By 
theorem 1.27 parts (b) and (d) (or [AL] thm. 3), a, is in {0,+1} for primes 
p dividing Ny. Thus Ky = Q and Aş is an elliptic curve. Faltings’ isogeny 
theorem (see [CS], sec. II.5) now tells us that E and A, are isogenous and we 
conclude that E is modular. o 


Remark 3.21 Note that the equivalence (b) (c) does not require Faltings’ 
isogeny theorem. 


Proposition 3.22 If the Fontaine-Mazur conjecture (conjecture 3.17) holds 
for some prime £, then the Shimura-Taniyama conjecture (conjecture 1.54) 
holds. If Serre’s conjecture (conjecture 3.13) holds for infinitely many £, then 
conjecture 1.54 holds. 


92 


Proof. The first assertion is immediate from proposition 3.20 and the irre- 
ducibility of pge. See [Se7], sec. 4.6 for a proof of the second. (We have 
implicitly chosen the field K to be Q, in the statements of conjectures 3.17 
and 3.13, but it may be replaced by a finite extension.) 


Remark 3.23 Note that to prove a given elliptic curve Æ is modular, it suf- 
fices to prove that conjecture 3.17 holds for a single £ at which EF’ has semistable 
reduction. Wiles’ approach is to show that certain cases of conjecture 3.13 im- 
ply cases of conjecture 3.17 and hence cases of conjecture 1.54. 


3.3 Hecke algebras 


In this section fix the following notation. Let Z be an odd prime, let K be a 
finite extension of Q,, let © denote the ring of integers of K, let A denote its 
maximal ideal and k its residue field. Fix embeddings K — Q;, Q — Q, and 
Q — C. Let J: Gg — GL2(k) denote a continuous representation with the 
following properties 


a s irreducible, 


( 
(b 


is modular, 


( 
(d) pla, is semi-stable, 


) P 
) Pi 
c) det 
) 
) 


(e) and if p # £ then #p(1p) i£. 


Let us first record the following lemma. 


Lemma 3.24 The representation pja, is absolutely irreducible where L = 


AV (-YEVPL). 


Proof: If it were not then we see that l /#p(Go) and so J is unramified at 


all p # £. Moreover we can check that plz, ~ at . If l > 3 we can use 


theorem 3.15 to deduce that p is modular of weight 2 and level 1 and hence 
obtain a contradiction. If 2 = 3 we see that the splitting field of p is everywhere 
unramified over Q(\/—3) and hence must equal Q(./—3), a contradiction. O 

Let X denote a finite set of primes. For the application to modularity of 
semistable elliptic curves, it suffices to consider sets X contained in X5 where 
Xz is defined as follows 
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Definition 3.25 For a representation p as above, we let X5 denote the set of 
primes p satisfying 


e p= and Pla, is good and ordinary; or 
e p + Land Jis unramified at p. 


We shall sometimes assume that X C X> in order to simplify statements and 
proofs. Let Ns denote the set of newforms f such that 


ps: Go > GL2(O}) 


is equivalent to a lifting of p ®, ky of type X and 0? N+ (the last condition is 
presumably not necessary, cf. remark 3.19). From theorem 3.1 and lemma 2.7 
one deduces the following description of Ns. 


Lemma 3.26 The set Ns consists of newforms f such that 
© pr = p@kky, 
e wy is trivial, 


e N; divides ON(p) J] p%™?’”, where 5 =0 if p is good and l g X and 
pEex—{e} 
ô = 1 otherwise. 


7 As p is modular it follows from theorem 3.15 that for all £, Vs 4 Ø. Set 
Ts = [pem O If p is a prime not in X and not dividing ¿N (p), we let Tp 
denote the element (a,(f))f in Ty. Then define Ty to be the O-subalgebra of 
Ty generated by the elements T, for such primes p. Then Ty is a complete 
noetherian local O-algebra with residue field k. Moreover it is reduced and it 


is a finitely generated free O-module. 


Lemma 3.27 There is a continuous representation 
peo" : Gg > GL2(Ts) 


such that if pJN(p) and p Z E then pee? is unramified at p and we have 
tr p¥°4 (Frob p) = T,. Moreover we have the following. 
(a) p&°* is a lift of p of type © and there is a unique surjection dy : Ry —> Ts 
such that pt ~ dy o pe’, 
(b) If =! D E then there is a unique surjection Ty — Ty such that p¥29 
pushes forward to p#°* and T, maps to T, for p{€N(p) and p g ¥. 
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(c) If K' is a finite extension of K and TS, is constructed in the same way 
as Ty but with K' replacing K then TS, = Ty 80 Ox. 


Proof. Consider p#°! = J] pp : Gg —> GL2(Ty). Choose some complex con- 


jugation, c, and conjugate p so that p(c) = (ò 1): By conjugating pad 


we may assume that modulo every maximal ideal it reduces to p and that 
parte) = (à a) Call the corresponding basis {e,,e_}. Note that by con- 


~mod ; 


tinuity and the Chebotarev density theorem tr p$°° is valued in Ty. Thus for 
any g € Go the diagonal entries of p#°*(g) lie in Ty, because tr P#4(g) and 
tr 8&4 (cg) both do. 


By irreducibility of ð we can find some o € Gg such that p(o) = E À 


where b Æ 0. Rescaling e, we may assume that p2°4(c) = - 7) Then for all 


g € Go the lower left entry of #2°4(g) lies in Ty (look at the upper left entry 
of p#e'(og)). Again using the irreducibility of p we can find T € Gg such that 
pet(r) = (€%) where e € TX. Looking at the lower right entry of p#°4(rg) 
we see that for any g € Gg the upper right entry of p@°"(g) lies in Ty. Thus 


pee is now in fact valued in GL2(Ts) and will be our candidate for p&°*. We 


leave the verification of the other properties of p&°* as an exercise. 


Example 3.28 Let p = /;,,,,3 where fs57g is the newform of level 57 discussed 
in the example of section 1.6. As fs57p is not congruent modulo 3 to any form of 
level 19 or 3 we see by theorem 3.15 that J is ramified at 19 and |c, is not good. 
On the other hand Jis semi-stable. The facts that p(Frob 2) has order 8 (see the 
table below) and 3|#/(J19) (from the discussion above) imply that J : Go — 
GL2(F3). The table below also shows that Tg & {(z,y) € Z : x = y mod 3}. 
There is a unique continuous representation pr : Gg — GLa(To) such that 
if p {57 then pp°? is unramified at p and tr pf°'(Frob,) = Ty. We have an 
isomorphism Ty/3Ty > F3[e] (where £? = 0) given by (x,y) — «+ 4*e. Thus 
we get a representation p : Gg —> GL2(Fs3[e]). The following table lists the 
traces of Frobenius elements for the first few unramified primes under these 
representations: 


p] 2 | 5 |7| 18 | 183 | 17 |2 | 29 |] 
I I 0 0 0 0 I =] 

py°"|(1, —2)|(—2, 1) (0, 3)|(0, —38)|(6, —6)(—6, 3)|(4, 4)|(2, —10) 

p ibs Ls. € —€ —€ 0 1 |—1—e 


Exercise 3.29 Show that there are three algebra homomorphisms Tọ — 
Z/9Z and hence show that there are at least three liftings of p of type 0 
to Z/9Z. 
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Exercise 3.30 What is the image of p? 


We will need two deeper properties of the Hecke algebras Ty (theorem 3.31 
and theorem 3.36 below). These results will be proved in the next chapter 
(sections 4.3 and 4.4). 


Theorem 3.31 Let Q be a finite set of primes as in section 2.8. Then Tg 
is a free O|Ag|-module, where Tg inherits the structure of an O|Ag|-module 
from Rg via the homomorphism ġo : Ra — Ta: 


Corollary 3.32 Ty = Tg/ag. 
Proof: From the definitions and corollary 2.45 we have that 
Ty 80 K = (To 80 K)/aQ, 


and from the theorem we have that Tg/ag is torsion free. The corollary 
follows. 

For the second of these results we will need some additional notation and 
we restrict our attention to sets of primes contained in X5. Suppose that 
X C X; and that f is an element of Ns whose Fourier coefficients are in O. 
Then O', = O and projection to the component corresponding to f gives rise 
to an O-algebra homomorphism 


t=T: Ts > O. 
The pushforward of p2°¢ by m is equivalent to 
pr: Go GLA(O). 


Remark 3.33 Most of the objects considered in the rest of the section will 
depend on the choice of newform f. We also remark that to give an O-algebra 
homomorphism Ts; — © is equivalent to giving a newform f in Ms with 
coefficients in ©. Indeed given such a homomorphism, there exists a newform 
f in Ms such that the homomorphism is defined by T, +> a,(f) for all p ¢ X 
with p not dividing 4N (p). The fact that ap( f) € O for such p implies (using 
for example parts (b) and (d) of theorem 1.27 and lemma 4.1 below) that all 
the Fourier coefficients of f are in O. The uniqueness of f is a consequence of 
the theory of newforms, theorem 1.22. 
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For X’ satisfying 
XC CE; 


let Ty; denote the composite Ty — Ts — O. Let 
nN = Ty (Ann Ty (ker Tyy)). (3.3.1) 


Note that because Ty, is reduced, ny 4 (0). Also let pyy denote the kernel of 
bx: o Tsx. Recall that 


Hox / Sy = #Hy(Q, adp; 80 K/O). 


Remark 3.34 We have not yet shown that these groups are finite, but if 
either is finite, then so is the other and their cardinality is the same. 


Note that if @ is in X; — © then Ty := (ae(g))y € Ty is actually in Ty. 
This follows from theorem 3.1(f), which shows that Ty = ay + ay! where ay 


is the eigenvalue of Frob p on the free rank one Ty-module M (0) where M is 


the module underlying pz? (see lemma 2.20). Now that we have shown that 


T; € Ts, we may characterize ay as the unit root in Ts of 
X?-T%|X +l=0. (3.3.2) 
For primes p in X5 — & we define an element cp € Ty by 
Cp = p= - (p + 1)”). 


Note that m(cp) = (p — 1)(a,(f)? — (p + 1)?), which is non-zero by theo- 
rem 1.27(a). A calculation using theorem 2.17 and theorem 3.1(a) shows that 
if p is in U5 — (X U £) then 


#H' (Gp, ad°ps 80 K/O)/H'(Gp/Ip, ad pp 80 K/O) 
= #H° (Gp, ad° ps 80 K/O(1)) = #O/n(G). 


If Z is in 4; — È then we see from proposition 2.27 and theorem 3.1(f) that 
#H} (Ge, ad° ps Q0 K/O)/H; (Ge, ad? ps Q0 K/O) = #O/n(cx). 
For the next proposition define groups Hp for p € X5 — X by 
e H, = H'(G>,ad° pz 80 K/O)/H"(G,/Ip, ad pz 80 K/O) if p # £ 
e and Hy = Hi (G, ad°p7 80 K/O)/H} (Ge, ad? pf 80 K/O). 


Then we have the following result. 
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Proposition 3.35 IE C X’ C X>, then 


#H} (Q, ad’ ps 80 K/O)/H}(Q, ad? ps Q0 K/O) < #(O/n( J| &)) 


pEex’—& 


Moreover if we have equality then the sequence 


(0) + H3(Q, ad°py 80 K/O) > H},(Q,ad°py 80 K/O) > GD H, > (0) 
5 


pEx!— 
as exact. 


Finally we state the second theorem on Hecke algebras which shall be 
proved in section 4.4. 


Theorem 3.36 If © C X! C X; and f is a newform in Ny with coefficients 
in O, then we have 


Ns Cm I] Cp) NS. 


pEed’—xd 


Corollary 3.37 With the above notation 


#Hy,(Q, ad° py 80 K/O)/H3(Q, ad°pf 80 K/O) < #(ns/ns). 


3.4 Isomorphism criteria 


The main thrust of Wiles’ approach is to prove that in many circumstances the 
map s : Ry — Ty is an isomorphism. For this we will need two criteria from 
commutative algebra. The first was found by Wiles [W3] (but is presented here 
in a slightly stronger form due to Lenstra [Len]); the second was developed by 
Faltings, from the original arguments of [TW]. In both criteria the notion of 
complete intersection plays a vital part. 

Proofs of all the results in this section, together with some background, 
references and examples, is given in chapter 5. 


Definition 3.38 Suppose that A is an object of Co which is finitely generated 
and free as an O-module. Then we call A a complete intersection if and only 
if for some r € Z>o and some f,,..., fr € OIX, ..., X,]] we have 


A S OX, a XG -o fr) 


(i.e. there are the same number of generators as relations). 
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We first record a lemma about complete intersections. 


Lemma 3.39 Suppose that K'/K is a finite extension with ring of integers 
O’ and that A is an object of Co which is finitely generated and free as an 
O-module. Then A is a complete intersection if and only if A 80 O' is. 


For the proof, see chapter 5, lemma 5.30. 

Now fix objects R and T of Co and a surjection of O-algebras ¢: R —> T. 
Also assume that T is a finitely generated, free O-module. The first criterion 
is as follows. 


Theorem 3.40 Suppose that 7: T —> O. Let p = ker (n o ¢) < R and let 
n = m7(Ann 7(ker7)) dO. Suppose also that n # (0). Then the following are 
equivalent. 


(a) The inequality #0/9? < #O/n is satisfied. 
(b) The equality #0/ 9? = #O/n is satisfied. 


(c) The rings R and T are complete intersections, and the map ọ : R — T 
is an isomorphism. 


The proof is explained in chapter 5, sections 5.1 to 5.8. (See theorem 5.3.) 

For the second criterion let us also fix a non-negative integer r. If J < 
O|[Si,...,S,]] is an ideal contained in (Sj,...,5,), then by a J-structure we 
mean a commutative diagram in Co 


Oa RS See! 


such that 
(a) T/S,- Sr) >T and R'/(Si,...,S,) —> R, 
(b) for each ideal I D J, I = ker (O[||S1, ..., S,]] > T'/D). 


Theorem 3.41 Suppose there exist a sequence of ideals Jn < O[[Si,..., S+]] 
such that Jo = (54,...,Sr), Jn D Inti, fìn Jn = (0) and for each n there exists 
a J,-structure. Then the map R — T is an isomorphism and these rings are 
complete intersections. 


The proof of this statement is explained in sections 5.9 and 5.10. 
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3.5 The main theorem 


We are now in a position to deduce the main theorems. We will keep the 
notation and assumptions from the start of section 3.3. 


Theorem 3.42 Keep the notation and assumptions of section 8.8. Then, for 
all finite sets & C Up, dy : Ry — Ty is an isomorphism and these rings are 
complete intersections. 


Remark 3.43 There seems to be a deep link between the fact that dy is an 
isomorphism and the fact that Ty is a complete intersection. The proof of the 
theorem divides into two parts. One first proves it in the minimal case where 
X = @. One then deduces the full theorem from this special case by a different 
argument. In both these steps the facts that @y is an isomorphism and that 
Ty is a complete intersection are proved simultaneously. 


Proof of theorem 3.42: Note that to prove the theorem we may extend scalars 
if necessary (by lemma 3.39) and hence assume that both of the following hold: 


e The eigenvalues of all elements of the image of p are rational over k. 


e There is a newform f in Ng with coefficients in O, hence an O-algebra 
homomorphism Ty > O. 


We first prove that @g : Rg —> Tg is an isomorphism and that the rings 
Rg and Tg are complete intersections. 

Note that according to theorem 2.49 and lemma 3.24 we can find an integer 
r > 0 and for each n € Zso we can find a set Qn of r primes such that 


o if q E Qn then q = 1 mod £”; 
e if q E Qn then J is unramified at q and p(Frob ,) has distinct eigenvalues; 
e Ro, can be topologically generated by r elements as a O-algebra. 


Let J) = ((% + 1)™ — 1: i= 1,...,r) < O[[S),... , S,]], where the n; are 
chosen such that O[[Si,...,5;]]/Jn = O[Ag,]. Then for each n we have a 
diagram 


MG Srl] 
ojx, Saa ee || =R. Ro, —> To, 
l { 
Ro —> To, 


where 
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(a) Ro, /(Si,--. , S) => Rg (see corollary 2.45), 


(b) To, is finite and free over O[[S),... ,S;]]/J}, (see theorem 3.31), 


(c) Tg, /(S1,--- ,S;) > Tg (see corollary 3.32). 


Let J, = ((S; +1)" —1:i=1,...,r). Replacing Tg, and Rg, by Tg, /Jn 
and Rg,,/Jn we see that we have a J,-structure for every n. Theorem 3.42 for 
£ = Ú now follows from the criterion of theorem 3.41. 

We now turn to the proof of theorem 3.42 in the general case. By theo- 
rem 2.41 and theorem 3.40 we see that 


# H (Q, ad°ps 80 K/O) = #O/no, (3.5.1) 
and so applying corollary 3.37 we see that for any © C Np 
#H3(Q, ad°ps 80 K/O) < #O/ns. (3.5.2) 


A second application of theorems 2.41 and 3.40 allows us to deduce theo- 
rem 3.42. 


Remark 3.44 In certain cases where ng = (1), the bound (3.5.1) on the order 
of the Selmer group Hj(Q,ad°p;@o K/O) also follows from the previous work 
of Flach, by a different method. See [F11] for details. 


Corollary 3.45 Keep the notation of theorem 3.42 and suppose that f is a 
newform in Ny with coefficients in O. 


(a) We have 
#H3(Q, ad" pp 80 K/O) = #O/n» < 00, 


where ns was defined in equation (3.3.1) after remark 3.88. 
O) FECE Cy; then 


(0) ar Ay(Q, ad” ps ®o K/0) at A, (Q, ad’ ps 8o k/O) 
= Dex» p — (0) 


is exact, where the groups H, and H; were defined before proposition 3.35. 


Proof: The first part now follows as a direct consequence of theorem 3.42 and 
another application of theorems 2.41 and 3.40. The second part follows from 
the first, together with proposition 3.35 and theorem 3.36. 
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Corollary 3.46 Suppose p : Gg —> GL(K) is a continuous representation 
and let p denote its reduction. Suppose also that 


(a) p is irreducible and modular, 


(b) if # l then pln ~ (OT), 
(c) pla, is semi-stable, 
(d) det p =e. 

Then p is modular. 


Proof. We let & denote the set of primes in X5 at which p is ramified. Then 
p : Go — GL2(Q) is a deformation of p of type X, so there is an O-algebra 
homomorphism Ry —> O such that p = p% @p,, O. Since dy : Ry — Ty is an 
isomorphism by theorem 3.42, it follows that there is a homomorphism Ty — 
O sending T, to tr (p(Frob,)). Since such a homomorphism is necessarily of 
the form Tp +> ap( f) for some newform f, it follows that p is equivalent to pr 
and hence is modular. 


Corollary 3.47 Suppose that E/Q is a semi-stable elliptic curve such that 
Pr is irreducible. Then E is modular. 


Proof. One need only apply the last corollary with Z = 3 and theorem 3.14. 


3.6 Applications 
The Shimura-Taniyama conjecture for semi-stable elliptic curves: 
Theorem 3.48 If E/Q is a semistable elliptic curve, then E is modular. 


Proof: By corollary 3.47, it is enough to show that EF’ is modular when its 
associated mod 3 representation Jg is reducible, i.e., when E has a subgroup 
of order 3 defined over Q. Consider the group E[5] of 5-division points of F. 
The mod 5 Galois representation pz, associated to E|5] is irreducible: for 
otherwise, Æ would have a subgroup of order 15 defined over Q, and would 
give rise to a (non-cuspidal) rational point on the modular curve Xo(15). This 
curve is of genus one, and is known to have only 4 non-cuspidal rational points, 
which do not correspond to semi-stable elliptic curves (and, at any rate, are 
known to correspond to modular elliptic curves). Hence we know that fz5 
satisfies all the assumptions of corollary 3.46, except the (crucial!) modularity 
property. To show that g is modular, one starts with 
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Lemma 3.49 There is an auxiliary (semi-stable) elliptic curve AJQ which 
satisfies 


(a) A[5] ~ E[5] as Ge-modules. 
(b) A[3] is an irreducible Go-module. 


Proof: Let Y'(5) be the curve over Q which classifies elliptic curves A together 
with an isomorphism E[5] ~ A[5| compatible with Weil pairings. Elliptic 
curves over Q satisfying (a) correspond to rational points in Y’(5)(Q). Ad- 
joining a finite set of points to Y'(5) yields its compactification X’(5) which 
is a twist of the modular curve X(5) with full level 5 structure. (L.e., it be- 
comes isomorphic to this curve, over Q.) As was shown by Klein, the modular 
curve X(5) over C has genus 0. Since X'(5) has a point xo defined over Q 
corresponding to F, it is isomorphic over Q to Pt. The rational points of Y’(5) 
therefore give a plentiful supply of elliptic curves satisfying condition (a). Now 
consider the curve Y’(5,3) classifying elliptic curves A with an isomorphism 
E|5| ~ A[5] (respecting Weil pairings) and a subgroup of A of order 3. One 
checks that the compactification of Y'(5,3) has genus greater than 1, hence 
has only finitely many rational points by Faltings’ theorem (the Mordell con- 
jecture). It follows that only finitely many points Y’(5)(Q) are in the image of 
Y'(5,3)(Q) under the natural map Y'(5,3) — Y’(5). Hence for all but finitely 
many points x in Y’(5)(Q), the corresponding elliptic curve A satisfies (b) 
since it has no rational subgroup of order 3. Choosing x arbitrarily close in 
the 5-adic topology to £o, we find that the elliptic curve A associated to x is 
semistable and satisfies the two conditions in the lemma. 

We can now finish the proof of theorem 3.48. Applying corollary 3.47 to 
the curve A, we find that A is modular. Hence so is the mod 5 representation 
PAs œ Pes. Now applying corollary 3.46 with ¢ = 5 and p the representation 
of Gg acting on the 5-adic Tate module of E, we find that pg 5 is modular, 
and hence, so is Æ, as was to be shown. 


Remark 3.50 Wiles’ original argument uses Hilbert’s irreducibility theorem 
where we have used Faltings’ theorem. The alternative presented here is based 
on a remark of Karl Rubin. 


Remark 3.51 The results of [W3] and [TW] actually apply to a larger class 
of elliptic curves than those which are semistable. In [Di2], their methods are 
further strengthened to prove that all elliptic curves which have semi-stable 
reduction at 3 and 5 are modular. 
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Remark 3.52 Rubin and Silverberg observed that an elliptic curve of the 
form y? = x(x — a)(x + b) has a twist with semi-stable reduction at all odd 
primes, hence is modular by [Di2]. In fact it is shown in [DK] that their 
observation together with the general results of [W3] and [TW] already imply 
modularity. 


Fermat’s Last Theorem: As was already mentioned in the introduction, 
the Shimura-Taniyama conjecture for semi-stable elliptic curves (and, more 
precisely, for the elliptic curves that arise in Frey’s construction explained in 
section 2.2) implies Fermat’s Last Theorem. 

More precisely, suppose that there is a non-trivial solution to the Fermat 
equation zf +yf = zf, with £ > 3. By theorem 2.15 the Frey curve constructed 
from this solution (cf. section 2.2) is a semistable elliptic curve E/Q whose 
associated mod £ representation fz is irreducible, unramified outside 2¢ and is 
good at £. Serre’s conjecture predicts that Jg arises from a newform of weight 
2 and level 2; the “lowering the level” result of Ribet [R5] (cf. theorem 3.15) 
actually proves this, once we know that E is modular, i.e., Jg arises from a 
modular form of weight 2 and some level. But this is a contradiction, since 
there are no modular forms of weight two and level two: such forms would 
correspond to holomorphic differentials on the modular curve X9(2) which is 
of genus 0. This contradiction completes the proof. 


Values of [-functions: Also mentioned in the introduction was the relation- 
ship between the calculation of the Selmer group (3.5.1) and certain cases of 
a conjecture of Bloch-Kato [BK], called the Tamagawa number conjecture. It 
was in this context that partial results were obtained by Flach in [F11] (cf. 
remark 3.44). 

If f is a newform of weight 2, then one can associate to f a certain “sym- 
metric square” L-function L(Symm °f, s). We shall recall the definition in sec- 
tion 4.4 and explain how a method of Hida establishes a relationship between 
L(Symm?f,2) and O/ny in the setting of corollary 3.45. We may therefore 
regard part (a) of that corollary as a relationship between L(Symm °f, 2) and 
the size of a Selmer group. While the result is in the spirit of the Tamagawa 
number conjecture of [BK], we have not verified that the relevant cases of the 
conjecture can be deduced from it. We shall however state a partial result 
in the context of semistable elliptic curves. The reader can consult [F11] and 
[F12] for a discussion of the relation to the Tamagawa number conjecture. 
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Suppose that E is a semistable elliptic curve over Q of conductor Ng and 
(minimal) discriminant 
Ap = I] p”. 


pP\Ne 
The symmetric square L-function associated to E is defined by 


L(Symm?E, s) = I] L (Symm? E, s) 


P 
where the Euler factors L,(Symm ?E,s) are defined as follows: 


e If p{ Nz, then 


L,(Symm*E, s) = (1 — agp™*)(1 — p'*)(1 — Bpp*)) 


where a, and (3, are the roots roots of X? —apX +p with ap = p+1—N, 
as in section 1.1). 


e If p|Ng, then L,(Symm?E, s) = (1 —p7*)7!. 


Let 
Qe = f oe A fee 
E(C) 


where wy" is the Néron differential defined in section 1.1. Since E is modular 
by theorem 3.48, a method of Shimura (see [Shi4] and the introduction of [St]) 
establishes the analytic continuation of L(Symm °F, s) to an entire function 
and shows that L(Symm7?F,2) is a non-zero rational multiple of itQg. We 
now explain how to deduce the following theorem from Wiles’ results and a 
formula of Hida, corollary 4.21. 


Theorem 3.53 Suppose that E is a semistable elliptic curve and £ is a prime 
such that 


è Pry is irreducible, and 


e ¢ does not divide 2 [| dp. 
p|Ng 
Then the €-part of 
NgL(Symm? E, 2) 
TIO g 


is the order of 
Hy (Q, ad’ pp p Sze Qe/Zz). 
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Sketch of proof: Since E is semistable, it is modular by theorem 3.48. Letting 
f denote the associated newform, we have N; = Np and L(Symm PT s) = 
L(Symm? f, s). Furthermore the representations pp and 


ps: Go > GL2(Ze) 


are equivalent, since they are equivalent over Q; and J = pge is irreducible. 
The conditions on F and £ ensure that p and f satisfy the hypotheses of 
corollary 3.45 with © = Ø, so that 


#Hġ (Q, ad’ pee Dz, Qe/Ze) = #(Ze/no). 


To apply Hida’s formula, corollary 4.21, it remains to relate det A and Qg 
where the matrix A € GL»2(C) is defined in section 4.4. Using that Æ is 
semistable, £ is odd and Jg is irreducible, one obtains a modular parametriza- 
tion 7: Xo(Ng) > E such that 


e 7*: H'(E,Z:) > H'(X0(Ne), Ze) has torsion-free cokernel; 
e the Manin constant for m is not divisible by £ ([Maz2], sec. 4(a)). 


One can then verify that Q;' det A is an ¢-adic unit and theorem 3.53 follows 
from corollary 4.21. 


4 Hecke algebras 


4.1 Full Hecke algebras 


Suppose that K is a finite extension of Q; for some prime ¢. Let © denote 
its ring of integers and let k = O/X where A is the maximal ideal of ©. Fix 
embeddings K — Q;, Q — Q, and Q => C. Recall that we defined “Hecke 
algebras” over O in two different contexts: 


e In section 1.6 as an algebra To generated by the full set of Hecke oper- 
ators acting on a space of modular forms; 


e In section 3.3 as a certain subring Ts of a product of fields of Fourier 
coefficients of newforms giving rise to the same mod £ representation. 


The first of these provides a concrete geometric description useful for establish- 
ing properties of the fine structure of the algebra; the second yields a reduced 
ring which is more easily interpreted as the coefficient ring of a Galois repre- 
sentation. In the next section we shall relate the two notions by identifying 
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the “reduced Hecke algebras” of the form Ty as localizations of the “full Hecke 
algebras” of the form To. Before doing so we need to recall some fundamental 
properties of the algebras Tx, To and T,. 

Let IT = [y(N) for some positive integer N and subgroup H of (Z/NZ)* 
(see section 1.2). We let Tz denote the subring of End (S2(T)) generated by 
the operators T, for all positive integers n and (d) for all d € (Z/NZ)*. If R 
is a ring, then Tp denotes the R-algebra Tz ® R. Recall that Tp acts faithfully 
on S(T, R) and is finitely generated and free as an R-module. (This holds for 
R = Z, hence for arbitrary R.) 

We first record the following lemma: 


Lemma 4.1 (a) Tr is generated as an R-algebra by either of the following 
sets of elements: 


e T, for all positive integers n. 
e T, for all primes p and (d) for all d in (Z/NZ)*. 

(b) Suppose that D is a positive integer relatively prime to N. If either D 
is odd or 2 is invertible in R, then Tr is generated as an R-algebra by 
either of the following sets of elements: 

e T, for all positive integers n relatively prime to D. 

e T, for all primes p not dividing D, and (d) for all d in (Z/NZ)*. 
For a proof of (a), see [DI], prop. 3.5.1; for (b), see p. 491 of [W3]. 
The spectrum of To: First note that Tx and Tą are Artinian, hence have 
only a finite number of prime ideals, all of which are maximal. Since To is 
finitely generated and free as an O-module, its maximal (resp. minimal) prime 
ideals are those lying over the prime À (resp. (0)) of ©. (This follows from the 


going-up and going-down theorems, [Mat] thms. 9.4 and 9.5, for example.) It 
follows that the natural maps 


To — To 80o K = Tx; and To > To 80o k S Tk 


induce bijections 


{ maximal ideals of Tx } + {minimal primes of To } and 
{ maximal ideals of T } < { maximal primes of To }. 


Moreover, since O is complete we have (by [Mat] thms. 8.7 and 8.15, for 
example) that the natural map 


To > | [Tm 
m 
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is an isomorphism, where the product is over the finite set of maximal ideals m 
of To and Tm denotes the localization of To at m. Furthermore each Tm is a 
complete local O-algebra which is finitely generated and free as an O-module, 
and each minimal prime P of To is contained in a unique m. 

Now suppose that f = Y` anq” is a normalized eigenform in S(T, K) for 
the operators T, for all n > 1. Then T, + a, defines a map Tz —> K and 
induces a K-algebra homomorphism 0; : Tx — K. The image is the finite 
extension of K generated by the an, and the kernel is a maximal ideal of Tx 
which depends only on the Gx-conjugacy class of f. Similarly a G,-conjugacy 
class of normalized eigenforms in S2(T, k) gives rise to a maximal ideal of T. 

Recall also that a normalized eigenform f in S2(T, K) has coefficients in 
Ox, hence gives rise by reduction to a normalized eigenform f in S(T, k). 
Furthermore if f and g are Gx-conjugate, then f and g are G,-conjugate. 

We have thus constructed a diagram of maps of finite sets whose commu- 


tativity is easily verified. 


{ normalized eigenforms in > { normalized eigenforms i in \ 
S(T, K) modulo Gx-conjugacy S(T, k) modulo G;-conjugacy 


{ maximal ideals of Tx } { maximal ideals of T% } (4.1.1) 


{ minimal primes of To } — { maximal primes of To }. 


Proposition 4.2 The vertical maps are bijective, and the horizontal maps are 
surjective. 


Proof. For the injectivity of the upper-left vertical arrow, note that if p is 
a maximal ideal of Tx, then all K-algebra homomorphisms Tx/p << K are 
obtained from a single one by composing with an element of Gx. For the 
surjectivity, let K’ = Tx /p and p’ denote the kernel of the natural K’-algebra 
homomorphism Tx —> K’. Since Tx acts faithfully on S(T, A’), the local- 
ization S2(T, Kk"), is non-zero, hence so is S(T, K")[p’]. (For an R-module M 
and an ideal I of R, we write M[I] for the intersection over the elements r in T 
of the kernels of r : M — M.) It follows that there is a normalized eigenform 
f in S(T, K’) so that p’ is the kernel of the 0% : Tx — K’, and therefore p is 
the kernel of 67. To prove that the upper-right vertical arrow is bijective, note 
that the above arguments carry over with K replaced by k. 


Remark 4.3 The surjectivity of the top arrow is called the Deligne-Serre 
lifting lemma ([DS], lemma 6.11). 
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Suppose that m is a maximal ideal of To. Note that the maximal ideals 
of Tx mapping to m are precisely those p for which pM To is contained in m. 
Note also that the natural map 


Om: Ta 80 K > | [T, 
p 


is an isomorphism, where the product is over such p. 

It is straightforward to check that the above constructions are well-behaved 
with respect to replacing the field K by an extension K’. More precisely, 
for each set S in the above diagram, there is a natural surjective map from 
w : S' — S where S’ is defined by replacing K with K’, and these maps are 
compatible with the maps in the diagram. Furthermore the maps 


T,@xK'> [|| Ty ad Tr@00> || Tw 
p'ew!(p) mew !(m) 
are isomorphisms by which am ®x K’ can be identified with J| aw. 
mew! (m) 
Associated Galois representations: Suppose that p is a maximal ideal of 
Tx and m is the associated maximal ideal of To. By lemma 1.39, 


Te(Ju(N)) 8z, K 


is free of rank two over Tx, so reduction mod p yields a two-dimensional vector 
space over the field Tx /p endowed with an action of Gg. The resulting Galois 
representation 


Pp : Go > GLa(Tx/p) 


is unramified at all primes p not dividing N£, and for such p the characteristic 
polynomial of pp(Frob p) is 


X?’ -T,X +p(p) mod p. 


If Z is odd, then fy is defined over To/m and we write pm for its semisimplifi- 
cation 
Pm: Go Eri GL2(To/m). 


Thus pm is unramified at primes p not dividing N£ and the characteristic 
polynomial of Frob , is 


X?—T,X +p(p) mod m. 
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Suppose that g is in the Gx-conjugacy class of eigenforms in S(T) corre- 
sponding to p. If g is a newform then Tx /p is isomorphic to the field denoted 
K; in section 3.1 and pp» can be identified with the representation 


Py: Gq > GLo( Ki) 


considered in theorem 3.1. If also ¢ is odd then py, is obtained from pm by 
extending scalars. More generally suppose that g is not necessarily a newform 
and consider the associated newform f. Let D denote the product of the primes 
which divide N but not Ny. Let TP denote the K-subalgebra of Tx generated 
by the operators T, for n relatively prime to D and (d) for d in (Z/NZ)*. Let 
I’ =T (Nș) where H’ is the image of H in (Z/N;Z)*, and let T, = T; 8 K 
where T; is the Hecke algebra pone on S2(I”). Restriction of operators defines 


a natural homomorphism Te — Tx which is surjective by lemma 4.1. The 
composite with Ti, —> K; factors through the field Te? /(pn TO), so we may 
identify K; with a subfield of Tx/p and p, is then equivalent to the extension 
of scalars of pp. If £ is odd, then pm and py are defined and equivalent over a 
common subfield of To/m and kr. 


The structure of Tx: We now give an explicit description of Tx in the case 
that K contains the coefficients of all eigenforms of level dividing N. Let Nr 
denote the set of newforms in S(T), i.e., the set of newforms f of level Ny 
dividing N such that H is contained in the kernel of the character 


(Z/NZ)* > (Z/NyZ)* Ž K™. 


By theorem 1.22 
S(T, K) = @ Sx, 


FENT 
where Sx, is spanned by the linear independent elements 


{ f (aT) |a divides N/Nș }. 
For each f = S*a,(f)q” in Mr let 
e Tx, denote the image of Tx in End gSx,f; 


e Ax s denote the polynomial ring over K in the variables u fp indexed by 
the prime divisors of N/Ny; 


e Ig ¢ denote the ideal in Ag, generated by the polynomials 


Up(N/N¢)-1 
Prplttyp) = upp "(ty — an fen + VPP): 
for primes p dividing N/N; (setting Ys(p) = 0 if p divides Nf). 
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Consider the K-algebra homomorphism Ax, —> Tx, defined by mapping u fp 
to the operator Tp. Since Py, is the characteristic polynomial of T, on the span 
of { f(ap'r) |i =1,... , vp(N/N) } for each a dividing N/(N pp’? /*2)), we see 
that Ix, is contained in the kernel of Ax,¢ —> Tx,r. Taking the product over 
f in Mr, we have a surjective K-algebra homomorphism 


[] 4«5/t«.1 > | [Trs 
f f 


Since the natural map Tg —> [[ Tx, is injective and 
f 


dim Tr a dimc S(T, C) = S © o0(N/Ny) = S dimy Ax ¢/Ikf; 
f f 


we conclude 


Lemma 4.4 There is an isomorphism of K -algebras: 


Qa: TK =? I] Ax, t/Ik,t 
FENT 


defined by 
e a(Tp)f =@(f) if p is a prime not dividing N/ Nf; 
e a(Tp)f = Ur, mod Ix. ¢ ifp is a prime dividing N/N;; 


e a((d)), = ve(d) ifd is relatively prime to N. 


Remark 4.5 It follows that the algebra Tx is a “complete intersection” over 
K in the sense that it is a finite-dimensional K-algebra of the form 


rab rene ah E 2 


for some r. 


4.2 Reduced Hecke algebras 


As in section 4.1, we suppose K is a finite extension of Q; with ring of integers 
O and residue field k, and fix embeddings K — Q;, Q — Q and Q — C. We 
assume in this section that £ is odd and we fix a representation 


pP : Go =z GL2(k) 
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which is modular (definition 3.12). Thus J is equivalent to pm (over To/m) for 
some I =T (N) and maximal ideal m of To. 

We suppose also that p has the properties listed at the beginning of sec- 
tion 3.3 and that X is a finite set of primes contained in X; (definition 3.25). 
We shall show that [ and m can be chosen so that the reduced Hecke alge- 
bra Ty can be identified with a localization Tm of the full Hecke algebra To. 
The main result is due to Wiles ([W3], prop. 2.15), but we also explain an 
important variant (|TW], lemma 1) which arises when considering special sets 
of primes Q as in section 2.8 above. 


Ty and the full Hecke algebra: Let IT = ['9(Ns) where 
Ns=€N(p) TT pom’ = [> I] 2 (4.2.1) 
pEex— {el} pIN(p) pEex—{é} 


with ô = 0 if J is good and ¢ ¢ X and ô = 1 otherwise. 

Suppose that f is a newform in Ny. Recall that Ms, defined in section 3.3, 
is the set of newforms f in S(T) such that J is equivalent to py over ky 
(lemma 3.26). Note that these representations are equivalent if and only if 


apl f) mod X = tr(p(Frob,)) for all p{ Ns, 


where A’ is maximal ideal of the ring of integers of K Z There is then a 
normalized eigenform g in S2(T, K+) characterized by 


e a,(g) = ap( f) if p does not divide Ny,/Ny; 
e a (g) =0 if pF £ and p divides Ny/Ny; 
e a(g) is the unit root of X? — ap(f)X + £ if £ divides Ny/Ny. 


Note that if £ divides Ny/Ny, then ô = 1 and J is flat, hence J is ordinary and 
ap(f) is a unit by theorem 3.1 (£f). 


Lemma 4.6 For f and g as above, the reduction g is the normalized eigenform 
in S(T, k’) characterized by: 


© a,(g) = trp;,(Frob,) if p = L or p is not in X (recall that pr, denotes 
the representation on I,,-coinvariants); 


e a,(g) =0 otherwise. 


Furthermore, if K is as in lemma 4.4, then g is the unique normalized eigen- 
form in Sg, s with this reduction. 
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Proof: If p is not in X U {£4} then the formula for a,(g) = ap(f) follows from 
theorem 3.1, parts (a) and (e). If p = Z, then we use parts (f) and (g) of 
theorem 3.1. Otherwise, we have a,(g) = 0 by theorem 1.27(d) (if p?|Ny and 
f has trivial character, then a,(f) = 0). The uniqueness follows from the fact 
that if p|| Np, then a,(f) = +1 (by theorem 1.27(b)). m 
Note in particular that g has coefficients in k and is independent of the 
choice of f in Ny. We let m denote the corresponding maximal ideal of To. 


Proposition 4.7 There is an isomorphism Ts —> Tm of O-algebras such 
that Tp +> T, for all primes p not dividing Ny. 


Proof. Replacing K by a larger field K’, we have Ts 00! —> Th (lemma 3.27) 
and Tm Q0 O' —> T'y (since there is a unique maximal ideal m’ of To over 
m). We are therefore reduced to the case where K is as in lemma 4.4. Note 
that Ns is the set of newforms f in Np such that 


a,(f) = tr (p(Frob ,)) for all p{ Nyé. 


(We have K = K; and write f for f mod À.) 
We now define an isomorphism of K-algebras 


k:Tua8 K > |] K, 
fENS 


such that for each f € Ms: 
e «(T,)¢ = a,(f) if p is not in X; 
e «(T,)¢ = 0 if p is in X — {4}; 
e «(Ty),¢ is the unit root of X? — ap(f)X +p, (O€ if £ is in Ð. 


Recall that Tn 8 K = IT, Tp where p runs over the primes of T whose 
preimage in To is contained in m. Thus according to lemma 4.4 we have 


TaS K > [] [J Arike 
FENT peM ¢ 


where M, is the set of prime ideals in Ax,¢/Ix,f whose preimage in To is 
contained in m. If f is not in Vy, then My, is empty. If f is in Ms, then 
My, consists only of the kernel ps of the map defined by up,f œ ap(g) where 
g is the eigenform of lemma 4.6. Furthermore the maps up, > ap(g) induce 
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isomorphisms (Ax,¢/Ik,f)p, —> K, and taking the product over f € Ny, we 
obtain the desired isomorphism &. 

Identifying Ty with the O-subalgebra of [|] tenn K generated by the ele- 
ments T, = (ap(f))s for all p not dividing NyZ, it suffices to prove that Ty 
contains «(T;,) for all p dividing N(p)é. Now observe that for each f in Ms 
the representation py is isomorphic to 


Go =e GL2(Ts) ae GL2(K) 


mod 


obtained by composing ps°° with the projection Ty — K. It follows that for p 
dividing N(p)0°, the image of (p#°*);,(Frob,) in K is (p¢)1,(Frob ») = apl f); 
and therefore 

(08°") 1p (Frob ») = (Tp). 


Finally if ô = 0, then we conclude from lemma 4.1 that Ty contains «(T7). 


A twisted variant: Suppose now that © is a finite set of primes contained 
in X; — {/} such that if p is in X then p(Frob,) has eigenvalues a, and 6p in 
k satisfying 


ap/ Bp FP. (4.2.2) 
Note that (4.2.2) is equivalent to 
tr (p(Frob »))? 4 (p + 1%. (4.2.3) 


Let Q denote the set of primes in © such that q = 1 mod £. The set of primes Q 
is therefore as in section 2.8. Choosing an eigenvalue a, of p(Frob ,) for each 
q E€ Q as in section 2.8, we regard Rg and hence Tg as an O[Ag]-algebra. 
(Recall that Ag is the the maximal quotient of (Z/([] )Z)* of €power 
order.) 

Instead of working with ['9(Vs) as in proposition 4.7, we shall now work 
with the group 


qeQd 


T =To(No) OT1(M), (4.2.4) 


M= I] vita 


pEX-Q qEQ 


where 


Remark 4.8 We are about to relate Tg to a localization of To, where To 
is now defined using the Hecke operators on S(T), with F defined by (4.2.4). 
Recall that Tg is defined using modular forms with trivial character, but note 
that the modular forms involved in the definition of To may have non-trivial 
character. The purpose of establishing this relationship is to give a concrete 
realization of the image of Ag in Tg for the purpose of proving theorem 3.31. 
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Suppose that f is a newform pair in Ng. For each q in Q, let @,(f) be the 
root of X? —a,q(f)X +q = 0 (in K4) whose image in ky is ag. We let g denote 
the unique normalized eigenform in S(T’, Ay) of trivial character such that 


e a,(g) =a,(f) if p is not in X; 
e a,(9) = a,(f) if p is in Q; 
e a,(g) = 0 otherwise. 


The reduction g is the unique normalized eigenform in S2(T, k’) of trivial char- 
acter such that 


e a,(g) = tr Pr, (Frob p) if p is not in X; 
e a,(g) = ap if p is in Q; 
e a,(g) = 0 otherwise. 


Thus g has coefficients in k and is independent of the choice of f in Ng. We 
let m denote the corresponding maximal ideal of To. 

Suppose for the moment that K contains the coefficients of all eigenforms 
of level dividing Ny. Let NV’ denote the set of newforms g in Np such that 


© Pg ~ P; 
© a(g) = ap for all p dividing N,/Nop. 


Suppose we are given a newform g € N’. Let py denote its character and 
write @, for the conductor of Yg. Note that g has trivial reduction and 
hence ¢-power order, and that Q, divides Q. By proposition 3.2 we see that 
if p E€ & — Q, then p, is unramified at p and hence N, is not divisible by 
p. Furthermore, by theorem 3.1 (e) and (4.2.2), Ng = NpQ,. Let € denote 
the character of (Z/Q,Z)* of l-power order such that €>7 is the primitive 
character associated to w,. Then 


9 @ E = >. Eg(n)an(9)q” 
is in Na, C No. 


Lemma 4.9 The map g> g Q & defines a bijection between N’ and No. 
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Proof: Suppose we are given a newform f in Mg; i.e., f isin Np,(qgy and py ~ p. 
For q € Q, we have (by lemma 2.44 for example) that 


Eta 0 ) 
~ =i r 
PElG ( 0 eé 


where € 74 : Gq > K* is a character whose reduction is the unramified charac- 
ter sending Frob , to ag. Note that the characters £s q|7, have ¢-power order. In 
particular N;/No is Q4 where Qz is the product of the primes q € Q such that 
Eşq is ramified (theorem 3.1 (d)). Note also that there is a unique character 


EF i (Z/Q;Z)* mA K* 
such that 
I, + Go — Gal (Q (Co,)/O) È > K” 


coincides with the restriction of €f to I, for each q|Q;. (We have written £p 
for the character of Gg as well as the corresponding Dirichlet character. We 
shall also write €; for the corresponding character of Ag.) Let g denote the 
newform associated to the eigenform 


N E (n)an fq” € S2(Fo(No) VT1(Q3), K). 
By proposition 2.6 and theorem 3.1 we have 
© p~ ppO&, 
oy =, 
e N, = NoQ; and 


e a,(g) = (Esla Esa) (Frob q) has reduction ¿p ¿(Frob 4) = aq for all q divid- 
ing Qg. 


Therefore g is in N” and f = 9 @&,. In particular g > g @€, is surjective and 
Eg = gog,- Injectivity follows as well on noting that if g @ €, = g' Q Ey then 
E; = £y, hence g = g’. 

Recall that To contains the operators (d) for d in (Z/MZ)*, and note that 
(d) — 1 €m for all d. Let H’ denote the kernel of 


(Z/MZ)* = (Z/(][ 92)” > Ao. 
qEQ 


116 


Since the order of H’ is not divisible by £, we find that J 4cp (d) ¢ m. It 
follows that if d € H’ then (d) = 1 in Tm. We may therefore regard Tm 
as an O[Ag]-algebra via the map d — (d). Recall that Tg is considered an 
O[Ag]-algebra via the map 


O[Ag| ==? Ro =: Tg. 
If p is a prime not in Q, let x, denote the unique element of Ag such that 
~2 = 
r, =P. 


Proposition 4.10 There is an isomorphism To —> Tm of O[Ag]-algebras 
such that Tp œ x,T, for all primes p ¢ & with p4 4N (p). 


Proof: We may enlarge K so that we are in the setting of lemma 4.9. We then 
define an isomorphism of K-algebras 


k: Tna 8 K > I] K 
geN' 
such that for each g € M: 
e k(Tp)g = 4p(g) if p is not in X or if p divides Q,, 
e «K(T;,), = 0 if pis in U —Q, 


e «(T))q is the root of X? — a,(g) + Ya(p)p with reduction a, if p is in Q 
but does not divide Q,, and 


e «(d), = v,(d) for all d E€ Ag. 


The existence of such an isomorphism follows from lemma 4.4 which gives 


Tn K — I] I] (Ax g/Ikg)p; 


gJENT PEMg 


where M, is the set of prime ideals in Ax,g/Ir whose preimage in To is 
contained in m. If g is not in M”, then M, is empty. If g is in N’, then M, 
consists of the prime ideal corresponding to the eigenform whose eigenvalues 
are prescribed as above, and one checks that (Ax, g/Ixg)p = K. 


Viewing Tg as a subalgebra of || K and matching indices via the bijec- 
ENQ 
tion f = g8 £; e g, we obtain an injective homomorphism of O[Ag]-algebras 


such that 
K' (Ip) = (E5(p)ap(9))g = K(£pTp) 

for primes p ¢ © such that p { Ng. Since Tg is generated over O by the set 
of such Tp, we see that «’(Tg) is contained in «(Tm). On the other hand, the 
image of xK’ contains the image of Ag, hence contains «((d)) for d in (Z/MZ)”* 
as well as x(Tp) for p ¢ X with pt Noé. 

It remains to prove that x(Tp) is in the image of «/(Tg) for p dividing 
Nol Ikeo q. Consider the composite 


Ro T > J] K. 
gEN’ 


univ 


The pushforward of p°“ is equivalent to ] [py ® ég and therefore J [p is 
equivalent to the pushforward of 


pa" Q ég" 


(where €g was defined in section 2.8). For primes p dividing Ng, we recover 
k(Tp) as the image of Frob, on the J,-coinvariants, hence «(Tp) is in «'(Tọo). 
For q € Q and g € N’, the pushforward of Eala, -€,q to the g-component is an 
unramified summand of p|, whose reduction sends Frob, to ay. It follows 
that this character maps Frob , to «(T,) and we conclude that «(T,,) € «’(Ta). 
Finally in the case that € does not divide Ng, we appeal to lemma 4.1 to 
conclude that «(T?) € K'(Tọo). 


Auxiliary primes: For © as in proposition 4.10, i.e. a set of primes satisfying 
(4.2.2), a somewhat simpler argument provides a similar a description of Tg 
with I replaced by 


I’ =Lo(Ng) AP y(M) (4.2.5) 


where H is the ¢-Sylow subgroup of (Z/MZ)*. While we shall make no direct 
use of this, the group I” will play a role in the proof of theorem 3.31 and 
we shall need to choose © so that T” has no elliptic elements; i.e., non-trivial 
elements of finite order. This is the case for example if X contains a prime 
p £1mod £ with p > 3. The group I” is then contained in (NV) for some 


integer N > 3, and therefore has no elliptic elements. Indeed if $ t) eT 


has finite order then the roots of X? — (a + d)X +1 are roots of unity and we 
deduce that this matrix is the identity. 

In order to show that © can be so chosen, we appeal to the following lemma 
(cf. [DT2], lemma 3). 
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Lemma 4.11 Suppose that G is a finite group, x : G — k* is a character of 


order d and p: G — GL(k) is a representation. Suppose that for all g ¢ ker x 
we have 


(tr p(g))*/ det a(g) = (1 + x(9))"/x(9). (4.2.6) 
e Ifd>3, then p is reducible. 
e If d=83, then p is reducible or has projective image isomorphic to A4. 
e Ifd=2, then Pltery is reducible. 


Proof: Note that if p(g) is a scalar then x(G) = 1. Hence x induces induces 
a surjective homomorphism y’ : G’ — Ca where G” is the projective image of 
p and Ca is cyclic of order d. Furthermore if d = 2, then every element of 
G’ — ker y’ has order 2. The lemma then follows from theorem 2.47(b). 


4.3 Proof of theorem 3.31 


We shall give a proof of theorem 3.31 which is based on the g-expansion princi- 
ple rather than the method of de Shalit [dS] employed in [TW]. It will be more 
convenient to consider the action of the Hecke operators on the full space of 
modular forms Mə(T). The Riemann-Roch theorem shows that the dimension 
of Mə(T) is g+s—1 where g is the genus of the modular curve Xr associated 
to T and s is the number of cusps on Xp (see for example [Shi2] thm. 2.23 or 
[DI] (12.1.5)). 


Eisenstein maximal ideals: One can give an explicit description of a space 
of Eisenstein series G(T) so that 


M2(T) = S(T) 6 G(T). 


(See for example [Hi3], lemma 5.2.) 

There is a natural action on Mə(T) by the Hecke operators (d) for all d € 
(Z/NZ)* and T, for all primes p. We shall only need to use the operators (d) 
and T, for p not dividing N, and we let Tz denote the subring of End (M2(T)) 
these generate. The ring Tz is commutative and is finitely generated and free 
as a Z-module. That Tz is finitely generated is proved for example by showing 
that M2(T, Z), the set of forms with integer Fourier coefficients at oo, is stable 
under the Hecke operators and contains a basis for M2(T) (see [DI], cor. 12.3.12 
and prop. 12.4.1). Alternatively, one can show that Tz acts faithfully on the 
cohomology of the non-compact modular curve Yp (cf. section 1.3). 


119 


For any ring A, we write M2(T, A) for M2(T,Z) & A and regard this as a 
module for Ta = Ty & A. If n is a maximal ideal of Ts we write Th for the 
localization. If m is a maximal ideal of To then we let m denote its preimage 
in To: 

We say that a maximal ideal n of To is Eisenstein if 


T, =p+1modn forall p= 1 mod N. 


One sees from the explicit description of the Eisenstein series that Tp = p+1 on 
G(T) if p = 1 mod N. We shall be interested in the non-Eisenstein maximal 
ideals because of the following lemma (see [R5], thm. 5.2(c)). 


Lemma 4.12 Suppose that £ is odd. The representation 
Pm: Go =. GL2(To/m) 
is absolutely irreducible if and only if m is not Eisenstein. 


Proof: If pm is not absolutely irreducible then pm ~ ¥1®@xX2 with N divisible by 
the product of the conductors of x; and x2. Furthermore if Z does not divide 
N, then one of the characters is unramified at @ while the other coincides with 
the cyclotomic character € on J. Therefore m is Eisenstein. Conversely if m is 
Eisenstein then proposition 2.6 implies that pm restricted to Gey¢,,) has trivial 
semisimplification from which it follows that pm is also reducible. 


Differentials: Suppose now that X is a finite set of primes satisfying (4.2.2) 
Moreover we assume that X contains a prime p > 3 with p#1 mod £. Let T 
and I” be defined as in (4.2.4) and (4.2.5); i.e. 


DP = To( Ng) NP1(M) 
TY =To(Ng) OP atM) 


where H is the (Sylow subgroup of (Z/MZ)*. Let X and X’ denote the 
modular curves associated to F and T”. 

We first consider the case where £4 Ng. The curve X has a smooth proper 
model ¥ over Z[1/NoM] such that the complement of the cusps parametrizes 
cyclic isogenies (E1, i1) — (£2,%2) of degree Ng where E; is an elliptic curve 
and i; is an embedding um —> Ej. (See [DR] or [Kat] for example.) The action 
of H on X extends to X and the quotient X’ = X/H is a smooth model over 
Z|1/NgM| for X’. The natural projection ¥ — X’ is étale. (The fact that it is 
étale on the complement of the cusps follows from the natural moduli-theoretic 
description of X’ using the fact that Tra(M) c F(p) for some p > 3. One 
then need only verify that X — X’ is unramified at the cusps.) 

For a Z[1/NoM]-algebra A, we define 
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e Qa = H? (Xa, Iyaa) 
© y= E(X, Wy y), 
> Qa E H? (Xa, 2%, ;4(P)), 
e 04 = (44,8, (D), 


where D (resp. D’) denotes the reduced divisor defined by the cusps of 4 (resp. 
x’). The q-expansion principle and standard base-change arguments allow us 
to identify these with S2(T, A), S2(T', A), M2(T, A) and M2(I’, A) (see [Kat], 
sec. 1.6, 1.7). We may therefore regard the first two of these as modules for 
T4 and all of them as modules for T4. 


Lemma 4.13 Suppose that A=k or K. 

(a) Qo @o AZ Dy and Oo, Go ALM, 

(b) The natural map X — X' induces an isomorphism 

m 5> 0E. 
(c) If n is a non-Eisenstein maximal ideal of To, then the localization at n 
of Qo > Qo is an isomorphism. 

(d) If m is a maximal ideal of T4 then Qa|m] is one-dimensional over T4/m. 

Sketch of proof: 


(a) In the case A = K this follows from the fact that K is flat over O, so 
suppose A = k. Identify Qo/AQo with the direct image of Q; under 
Xk > Xo and use that H*(%,, Q% ,(D)) vanishes by Serre duality. The 


case of (' is similar. (See [Mazi] sec. II.3.) 


(b) Using the fact that ¥ — 2X’ is étale one identifies the pull-back of Qe JA 
with esa and that of D’ with D. 


(c) The map is injective and one proves that its cokernel is free and annihi- 
lated by the operators T, — (p+ 1) for all p = 1 mod NoM. To prove the 
latter assertion, first observe that it holds with O replaced by C, then 
by Z[1/NoM]. 
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(d) This follows from 
Qaim] S S(T, A)[m] = Hom (Ty, A)[m] = Hom,(T4/m, A), 


where the middle isomorphism is that of proposition 1.34, but let us 
reformulate the argument in a way more easily generalized to the case 
of £ dividing Ng discussed below. Note that Q,4[m] is non-zero since 
Tm acts faithfully on Qm. To prove that the dimension is at most one 
we can enlarge the field A and assume T4/m = A. One then shows 
that an eigenform f in Q4 (for all the Hecke operators T, and (d)) is 
determined by its eigenvalues and the first coefficient of its g-expansion. 
This follows from the fact that f is determined by its g-expansion and 
for all n, an(f) = ai(Ta f) and T, can be expressed in terms of the T, 
and (d). (See [Maz1], sec. II.9.) o 


We now explain how the situation changes if 4 divides Ng. In that case 
X has a regular model ¥ over Z[¢/NgM] with the same moduli-theoretic de- 
scription as above. This model is smooth over Z[1/NọM], but ær, has two 
smooth irreducible components crossing at ordinary double points as in [DR] 
sec. V.1. The quotient X’ = X/H is a regular model for X’ over Z[¢/NoM] 
and ¥ — X’ is étale. For a Z[@/NgM|]-algebra A we define Q4, 04, Qa and 
Q4 as before, except that Q! is replaced by the sheaf of regular differentials 
(see [DR] sec. 1.2, [Maz1] sec. II.6 or [MRi] sec. 7). Formation of these mod- 
ules again commutes with change of the base A, but we can no longer identify 
them with S2(T, A), S2(T', A), Mə(T, A) and M2(T', A) if Z is not invertible 
in A. For every A, there is a natural action of T4 (resp. T4) on Q4 and 0, 
(resp. Q4 and (,). In the case of Ty this is proved by identifying 94 (resp. 
Q',) with the cotangent space at the origin for the Néron model over A for 
the Jacobian of X (resp. X’). In the case of T4 the action is defined using 
Grothendieck-Serre duality (as in [Maz1] sec. II.6 or [MRi] sec. 7). 


Lemma 4.14 If £ divides Ng then lemma 4.13 carries over with the above 
notation and the additional hypothesis T; € m for part (d). 


The proof is essentially the same except that part (d) is more delicate in the 
case A =k. For the proof in that case we refer the reader to [W3], lemma 2.2 
(from which the result stated here is immediate). We remark however that we 
shall only use lemma 4.14 if pm is not good. In that case one can also deduce 
the statement here from the argument used in the proof of [MRi] prop. 20, 
which shows instead that dim, Q,|m’] = 1 where n is a certain maximal ideal 
corresponding to m, but contained in an algebra defined using the operators 
(d), T, for p # £ and wy. 
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Remark 4.15 The approach in [TW] to proving theorem 3.31 is based on the 
result that under certain hypotheses (7e(J) ®z, O)m is free of rank two over 
Tm (see [W3], thm. 2.1 and its corollaries). This result generalizes work of 
Mazur Ribet, and Edixhoven (see secs. 14 and 15 of ch. II of [Maz1], thm. 5.2 
of [R5], [MRi] and sec. 9 of [Edi]), and the key to its proof is lemmas 4.13(d) 
and 4.14. We shall instead give a proof of theorem 3.31 which is based directly 
on lemma 4.13, (d). 

The fact that (Te(J) 8z, O)m is free of rank two over Tm actually underlies 
Wiles’ approach to many intermediate results along the way to proving the 
Shimura-Taniyama conjecture for semistable elliptic curves. We shall appeal 
to a special case in the course of proving theorem 3.36 below. 


Proof of the theorem: Suppose we are given a representation p as in sec- 
tion 3.3 and a set of primes Q as in the statement of theorem 3.31. We apply 
lemma 4.11 with G = Gg and x = e to choose an auxiliary prime p not dividing 
6NolT [eq 4 such that 


e p#1modl 


e tr (p(Frob ,))? # (p+ 1). 


To prove theorem 3.31 we may replace K by a larger field and assume that k 
contains the eigenvalues of p(Frob p). 

Let © = QU {p}. Thus © is a set of primes as in proposition 4.10 and £ 
contains a prime p > 3 such that p # 1 mod @. We let T be as in (4.2.4); thus 


P = oWo) N Ti(M) 


where M = p’ JJ aco 1 and we choose the maximal ideal m of To as in propo- 
sition 4.10. Let H denote the 4-Sylow subgroup of (Z/MZ)* and regard To 
and hence Tm as an O|H]-algebra via d +> (d). In view of proposition 4.10, 
theorem 3.31 is equivalent to the following: 


Theorem 4.16 Tm is free over O[H]. 
Proof: Consider the To-module 
LE = Homa (Qo, O) 


and the To-module 
L= Homo (Qo, O). 
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Since X — X’ is unramified, the Riemann-Hurwitz formula implies that 
dim M2(T) =g+s—-1=#H(g'+s'-1)=#Hdim™,(I’), 


where g (resp. g’) is the genus and s (resp. s’) is the number of cusps of X 
(resp. X’). It follows from lemma 4.13 (a) (and lemma 4.14) that 


dimg(L 80 K) = #H dim, 0%, (4.3.1) 
On the other hand, by (b) we have that 
L/(a,\)L S Hom, (Q#, k) = Hom, (M,, k) (4.3.2) 


where a = ker (O|H] — O) is the augmentation ideal. By Nakayama’s lemma 
and (4.3.2), L is generated as an O|H]-module by d elements where d = 
dim, 9). By (4.3.1) any surjective homomorphism 


OJH = L 
is in fact an isomorphism. Hence L is free over O[H], as is 
Ln = Homo (Qa, O) & Homo (Qn, O) 


for each non-Eisenstein maximal ideal n of To (by part (c) of lemmas 4.13 
and 4.14). 

Since pm ~ J is absolutely irreducible, m is not Eisenstein (by lemma 4.12), 
and it follows that Lm is free over O[H]. 

Since L &o K is free of rank one over Tx (lemma 1.34), we have that 
Lm Qo K is free of rank one over Tm 80 K. If £f Ng then lemma 4.13 implies 
that 


is one-dimensional over k. The same assertion holds if 4|Nọ by lemma 4.14 
(from the definition of m in this case, we see that it does not contain T). It 
follows from Nakayama’s lemma that Lm is free of rank one over Tm. Therefore 
Tm is free over O[H]. 


4.4 Proof of theorem 3.36 


Our proof of theorem 3.36 is based on Wiles’ arguments in ch. 2 of [W3], 
which in turn are based on a method of Ribet [R4]. We shall reformulate the 
proof somewhat to underscore the relationship observed by Doi and Hida [Hil] 
between the size of O/7 and the value of an L-function, but after doing so we 
shall also sketch the more direct argument used by Wiles. Two important 
ingredients appear in both versions of the argument, and we shall discuss their 
proofs in section 4.5. These ingredients are 


124 


e the generalization of a result of Ihara [Ih] used in Ribet’s argument; 


e the generalization of a result of Mazur [Maz1] on the structure of the 
Tate module T(Jo(N)) as a module for Tz,. 


Assume now that we are in the setting of theorem 3.36. In particular, J is 
a representation as in section 3.3, X is a finite set of primes contained in X5 
and 7 is an O-algebra homomorphism Ty — O arising from a newform f in 
Ns with coefficients in O. 


The symmetric square L-function: For each prime p let a,(f) and (f) 
denote the roots of the polynomial X? — a,(f)X +6,p = 0, where 6, =0 or 1 
according to whether p divides Ny. Thus we have 


Ly(f,8) = (1 — ap (fp) = Blf). 


If p = £ divides N/N;, then we require that a¢(f) be the root which is a unit 
in Ø, i.e. ae(g). We define the symmetric square L-function associated to f as 


L(Symm’ f, s) = I] L,(Symm * f, s), 
p 


where 
Lp(Symm’ f, 8) = (1 — a3 (f)p-*) "(1 — al PBPT *) "(1 — B). 


(We caution the reader we have defined the Euler factors rather naively at 
primes p such that p* divides Ny.) The product converges absolutely for real 
part of s > 2 and can be analytically continued to an entire function. 


The calculation of ns: We write p for the kernel of 7 and J for the annihilator 
of p in Ty. Since Ty is reduced, we have oN J = 0 and p @ I has finite index 
in Ty. Note that 

O/n> = Ts/(p T). 
Suppose that we are given a Ty-module L which is finitely generated and free 
over O, and such that L & K is free of rank d over Ty &o K. Suppose also 
that L is endowed with a perfect O-bilinear pairing 


LxL— O 
(x,y) > (x,y) 
such that (Tz, y} = (x, Ty} for all z,y € L and T € Ty. Note that to give 
such a pairing is equivalent to giving an isomorphism 
L — Homo(L, O) 


zr => (z, 
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of Ty-modules. We shall refer to the module L together with the pairing ( , ) 
as a self-dual Ty-module of rank d. We then can give a lower bound for the 
size of O/ny, in terms of a basis {21,22,... , Za} for the free O-module L]g]}: 


Lemma 4.17 We have 
ns C O det( (xi, 5) aj, 
and equality holds if L is free over Ty. 


Proof. The modules L[{g] and L/L|I] are free of rank d over O and the pairing 
( , ) induces an isomorphism 


LILII] ~+ Homo(L{p), 0). (4.4.1) 


The O-module M = L/(L[p] @ L[I]) is annihilated by ny and is generated by 
d elements. Furthermore M is isomorphic to (O/ns)¢ if L is free over Ty. It 
follows that 

#(O/ns)* > #M, 
with equality if L is free. The cardinality of M is that of the cokernel of the 


map 
Lip] Homo (L[p], O) 


arising from the pairing ( , ), and this is precisely 


O/ det((xi, x;))i 7. 


Recall that proposition 4.7 establishes an isomorphism between Ty and 
Tm, the localization at a certain maximal ideal m of To = Tz & O, where 


Tz C End (S2(To(N))), 


and where N = Ny is defined in (4.2.1). Write X for Xo(N) and J for Jo(N). 
Recall that H,(X,Z) is endowed with the structure of a Tz-module and 


Ti(J) @z, O = Hy(X,Z) 8 O 
with that of a To-module. We also regard 
H'(X,O) S Hom(H,(X,Z), O) 
as a To-module; as such it is naturally isomorphic to 


Homo(Te(J) 8z, O,O). 
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The Weil pairing defines a perfect pairing 
T(J) x T(J) > Ze 


(with (e?7’/“"),, as the chosen generator for lim pe» (C)). The composite 


T(J) “3 T(J) > Homz,(Te(J), Ze) (4.4.2) 


is an isomorphism of Tz,-modules where w, is induced by the involution w = 
wy of X (see section 1.4). Tensoring with O and localizing at m, we regard 


Lr = (Zo( J) QZ O)m 


as a self-dual Ty-module of rank 2 via the isomorphism Ty = Tm of proposi- 
tion 4.7. We shall write (,)z for the pairing obtained from (4.4.2). Similarly, 
using w*, the cup product and Poincaré duality, we regard 


Ly = H'(X, O)m 


as a self-dual Ty-module of rank 2, with (x,y) defined by the image of x U 
w*y under the canonical isomorphism of H?(X, O) with O. Note that Ly is 
canonically isomorphic as a Ty-module to Homo(L7, ©). In the next section 
we shall discuss the following generalization of a result of Mazur [Maz1]. 


Theorem 4.18 The Ts-modules Lr and Ly are free. 


Corollary 4.19 If {x,y} is a basis for Lrg] (resp. Ly|o]), then ny is gen- 
erated by (x,y)r (resp. (x,y) a). 


This follows from lemma 4.17, theorem 4.18 and skew-symmetry of the pair- 
ings. 
Hida’s formula: We now explain how the value of (x,y) in corollary 4.19 
is related to L(Symm’ f, 2) by a formula of Hida (see [Hil] sec. 5, [W3] sec. 
4.1). 

Recall that the homomorphism 


To > Ty = TsO 


arises as Tn ++ a,(g) for a normalized eigenform g in S2(To(N), K) whose as- 
sociated newform is f. We shall write P for the kernel of this homomorphism; 
thus P is the preimage in To of the ideal p of Ty. Note that Pm corresponds 
to p under the isomorphism Tm = Ty. 
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Choose a number field Ko containing the Fourier coefficients of g and let 
Oo denote the valuation ring OM Ko. We choose the basis {x, y} for Lylo] in 
the image of 


H"(X,Oo)[Po] C H'(X, O)[P] — H'(X, O)m[P] = Lu lel, 
where Po = P N To,. Let Pc denote the kernel of the homomorphism 
0, : Tc —> C 
associated to the eigenform g. The two-dimensional complex vector space 
(H! (X, Oo) [Po]) 80, C = H'(X, C) [Pe] 


has a canonical basis 

{Wg, Dye}, 
where we view these differential forms as cohomology classes and the super- 
script c indicates complex conjugation applied to the Fourier coefficients. (Re- 
call that wg is the holomorphic differential on X defined by S*a,,(g)q" ‘dq, 
SO Wge is the antiholomorphic differential defined by >> a,(g)q"~'dg where 
q = e?""".) Let A denote the matrix in GL2(C) such that 


(Wg, Wge) = (x, y)A. 
Theorem 4.20 With the above notation we have (up to sign) 
(a, y) = (ix det A)“'N,L(Symm?f,2) || (P), 
p|N/N¢ 


where 
E tna 
(ael f) — Blf) -ar (f)) if p = L 


Proof: We have 
(x,y) det A = f Wg A Dirge = 8Sr’ilg, (wg)°) 
x 


where the last (,) denotes the Petersson inner product and we have used that 
wg? = (wg)° for forms on To( N). We then appeal to a formula of Shimura 
(see [Shi5] (2.5) and [Hil] (5.13)) to obtain 


(g, (wg)*) = (487) "[SLo(Z) : To(N)]ress=2D(g, wg, 8). 
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By [Shi5] lemma 1, the Dirichlet series D(g, h, s) is defined by ` an(g)an(h)n* 
and if g and h are normalized eigenforms then this has an Euler product 
expression in which the factors are 


(1 — ap(9)Bp(9)ap(h)Bp(h)p~**) (1 — ap(g)ap(h)p~*)- 
(1 = apl g)bp(h)p =) (1 z Bplg)aplh) p) "(1 T Bpl9)Bo(h)p™:) +, 


(where the a,’s and 8p’s are defined as they were for f). Using the recipe for 
obtaining g from f (see lemma 4.6) we find that if £ is not in © then 


D(g,wg,s) =+Ds(f, f,s) [| 2>, 
pIN/Nf 


(where the subscript X indicates that the primes in X are removed from the 
defining Euler product). If / is in X, then we must also multiply by the factor 


o (1— £5)! if £ divides Ny; 
© (1-0) (1 — alf BS) — ag? (S)) otherwise. 


To obtain the formula in this last case, consider the eigenform 


fa = f a Bel f) f(e7) 
on T'o(Nr@). Since 
Wnyefa = (Lf (Er) — ag (f) f) 

and 

D( fa, Lf (T), s) = al f) D( fa, Í, s), 
we get 

D( fa, WN pefo, s) =< +6" (Be(f) ra ael FJC) D, fa, J; 5) 
and use the Euler product. The Euler product expression also gives 
_ C&n(s— 1)Ly(Symm? f, s) 


Ds f,4,8) = Cn (2s — 2) ’ 


where Cy(s) is the Riemann zeta function with the Euler factors removed at 
primes dividing N. Thus 


6 NLy(Symm’f, 2) Twi, P 
T? [SL(Z): T(N) 


with the extra factor of (1 — @~*)~! or 
(A Sas na=or OP) dat) 


if Z is in X. The theorem follows. oO 


res,-2D(g,wg,s) = + 
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Corollary 4.21 With the above notation (in particular, © C 5), we have 


(™ N;Ly,(Symm ”f, 2) 


in det A ence 


is a generator of ng, where 


0, ifl’, 
m= { 2,if lE X and €|Ny, 
3, if LEE and L{ Ny. 
Remark 4.22 Without theorem 4.18 we obtain instead that ny is contained 
in the ideal generated by the expression in the corollary. 


Comparing ny’s: Suppose that X’ C X; is a finite set of primes containing X. 
In view of corollary 4.21, theorem 3.36 reduces to a comparison of det A and 
det A’ where A’ is defined using X’ instead of X. We shall make the desired 
comparison using Wiles’ generalization of a result of Ihara. 

Applying proposition 4.7 to X’, we have an isomorphism between Ty and 
Ti where mw is a certain maximal ideal of Tp = T, & O with TZ acting on 
So(To(Nx)). We write P’ for the preimage in Th of g'. We let N’ = Ny, 
X’ = Xo( N’), J’ = Jo(N’) and define self-dual T’ y-modules L4} and L'y as 
above. 

We now define a homomorphism 


or : Lz > Lr 


of T’ -modules in the case that &’ = XU{p} for some prime p € X5—¥}. We first 
suppose that p 4 £, in which case N’ = Np? and the surjective homomorphism 
of O-algebras T'y > Tm (corresponding to Ty, + Ty) is defined by T, + T, 
ifr Æ p and T, + 0. The maps T +> T, T |> pr and T |> p*7 on the upper 
half-plane induce maps X’ — X which we denote a, @ and y. These give rise 


by functoriality to a map 


Is JxJxJ 
LE (AL, BaL, YaTT) 


which induces an O-linear homomorphism 
TÁJ) 8z O > (Til J) Gz, OF. 
We compose with the projection to 


Ly = (L(I) 9z, O); 


m 
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followed by the map L} — Ly defined by the matrix (1,—p~'T,,p~'). The 
reason for this choice of matrix is that 


g = (a* — p'e Tp + py") g (4.4.3) 
is the eigenform corresponding to P’. 
Lemma 4.23 The composite 
(J) ®z, O > Lr (4.4.4) 


is Tl,-linear where Ly is regarded as a T!,-module via the surjection 
O O J 


To zs are Wes 


m’ 


We leave the proof to the reader after pointing out that the map on Hecke 
algebras can be rewritten as 


T a Tolul/(f(u)) > Tmlul/(f(u)) > Tm 


A u u => 0, 


where f(u) = u? — Tpu? + pu. 
It follows that (4.4.4) factors through L and we thus obtain the desired 
homomorphism 
or: Ly > Lr 


of T/,-modules. Note that ¢7 is also the composite of the canonical splitting 
Ly = T(J’) Ze O 


with (4.4.4). 

The construction of @7 is similar for p = £ except that we have only two 
copies of J and the map L2 — Lzy is given by the matrix (1, -a7') where ay is 
the unit root of (3.3.2). For arbitrary X’ satisfying & C X’ C X; we define or 
as a composite of the maps defined above. It is independent of the choice of 
ordering of ©’ — X. Recall that ¢> is used to denote the dual map Lz > L7. 
We define dy : Li; — Ly as the adjoint of the map Øy which renders the 
diagram 

by : HHX, O)m E HECK, O) ra! 


Home itr, O) — Homo tn O) 


commutative (where the vertical maps are the natural isomorphisms and the 
bottom one is dual to @7). 

The second crucial result whose discussion we postpone until the next 
section is the following generalization of a lemma of Ihara. 
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Lemma 4.24 ¢7 and @y are surjective. 


Suppose again we are in the case ©)’ = X U {p} with p 4 l. One need only 
unravel the definition of ġ'y to see that the diagram 


A(X, me [Po] E ae) [Po] 
i G Ie 
commutes, where the top arrow is defined as 
a* — p`r(Tp) 6" + pty”. 


Extending scalars to C, this map sends wg to wy and Wye to Wye where g' is 
defined by (4.4.3). In the case X’ = X U {7}, the same assertion holds if Oo is 
chosen so that it contains T(ap) = a¢(g’) and the top arrow is defined as 


a* — n(ae) 6. 


We conclude that if {x,y} is a basis for H'(X, Oo)[Po] and the matrices A 
and A’ are defined using the bases {x,y} for H!(X, Oo)[Po] and {d),7, o,y} 
for H!(X', Oo)[P}], then A’ = A. Theorem 3.36 is now immediate from corol- 
lary 4.21. In fact, we have proved that 


Nyy = T( I] Cp). 


per’ -X 


Remark 4.25 Note that to obtain the inclusion stated in theorem 3.36 it suf 
fices to apply theorem 4.18 for U, rather than both © and X’ (cf. remark 4.22). 


Wiles’ argument: The method of [W3] sec. 2.2, like that of Ribet in [R4], 
is more direct than the one given above but does not explicitly illustrate the 
relation with values of L-functions. The approach is simply to compute the 
composite @7¢',. It suffices to consider the case X’ = X U {p} and we suppose 
first that p # £l. Let A denote the endomorphism of J? defined by the matrix 


Ox 
Am (2:) wh(at, B, yw. 


Using the relations aw’ = wy and Bw! = wi, we find that 
AY“ a,3* a0 
A= (27 bp” i) ) 
YY" WE Yor 
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which can be computed for example by considering its effect on the cotangent 
space S2(To(N))?. The result is that 


T? —(p+1) pT, pip +1) 
A= pT p(p + 1) : pT 
p(p + 1) pIp te pay) 


which we note commutes with the action of Tz on J?. One then checks that 


1 
borer = (1, —p "Tp, p~') (L(A) Ze O)m (7) 
P 
= —p*(p — 1)(T? — (p+ 1)”). 


The case of p = £ is similar but simpler. One uses 


= Ox / * ok = T, £ + 1 
= (G5) wi(a BW = ai T; ) 
and gets 
bro = —(a — fay") (1 — az’). 
Thus in either case we find that ¢r@> acts on L4[P] by an element of O = 


Tm/P which is a unit times a(c,). Theorem 3.36 then follows from corol- 
lary 4.19 and lemma 4.24. 


4.5 Homological results 


In this section we sketch the proofs of theorem 4.18 and lemma 4.24, but shall 
often refer the reader to ch. 2 of [W3] for more details. 


Multiplicities: We first consider theorem 4.18, generalizing a result proved 
by Mazur in sections I.14 and II.15 of [Mazl]. Recall that Lr (resp. Ly) 
is defined as (T(Jo(N))@o0)m (resp. H!(Xo(N), O)m) where m is a maximal 
of the ideal of the Hecke algebra To generated by the Hecke operators on 
Sə(To(N)). We are assuming moreover that pm is irreducible and that one of 
the following holds: 


e ¢ does not divide N; 
e l? does not divide N and T, ¢ m. 


We wish to prove that Ly and Ly are free over Tm. 
Since Ly and Ly are isomorphic as Tm-modules, it suffices to prove that 
Lr is free. Furthermore, to prove that Ly is free, it suffices to prove that the 
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localization of Tp(Jo(NV)) at m N Tz, is free; i.e., we may replace O by Z; and 
m by its intersection with Tz, and consider 


Ti(Jo(N))m- 


Since Te(Jo(N)) 8z, Q is free is free of rank two over Tg,, it suffices to prove 
that 
dimp/m(Lr/mL7) — 


Since 


Te(Jo(N))/€Te(Jo(N)) = JN) = Home, (Jo(N) [4], Fe) 


as Ty,-modules, it suffices to prove the following cases of [W3] thm. 2.1. 


Theorem 4.26 Let m be a maximal ideal of Tz,. Suppose that pm is irre- 
ducible and that either 


(a) £ does not divide N, or 
b) CIN and Ty ¢ m. 


Then 
dimy,, /mJo(N)[m] = dimp,, /m(Jo(N)[é]/m) = 2. 


The proof of theorem 4.26 in case (b) requires more of the theory of group 
schemes and Néron models than we wish to delve into here. We shall therefore 
only explain the proof in the case (a) (which is [R5] thm. 5.2(b), see Ribet’s 
paper for more details) and refer to [W3] sec. 2.1 for the general case. We 
remark however that the proof of theorem 3.42 appeals to theorem 3.36 only 
in the case © = Ø (cf. remarks 4.22 and remark 4.25). Recall that £ divides No 
only if p is not good. So for the purposes of proving the Shimura-Taniyama 
conjecture for semistable elliptic curves, (b) can be replaced by the stronger 
hypothesis 


(b') £24 N and pmlc, is not good. 


The result in this case is due to Mazur and Ribet (the main result of [MRi]) 
and the proof is slightly easier than in the case of (b). 

Returning to case (a), we appeal to a general property of the functor D of 
theorem 2.31 which follows from cor. 5.11 of [Oda]. 


Theorem 4.27 Suppose that A is an abelian variety over Ze (i.e., the Néron 
model of an abelian variety over Q; with good reduction). There is a canonical 
and functorial isomorphism of vector spaces over Fe 


D(A(Q,)[é]*)° = Cot 9( A/F), 


where Cot 9 denotes the cotangent space at the origin. 
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In the case that A is the Jacobian of a smooth proper curve X over Z,, we 
have also a canonical isomorphism 


Cot o(A/ Fe) = P(X, QR) 


and in the case X = X,(N) with £ { N, this provides an isomorphism of 
Ty,-modules 


D(Jo(N) (Qe) [E]*)° = S2(To(N), Fe). (4.5.1) 


Consider now the action of Gg on the points of 
V = (N) im] = (NEm). 


By the argument in [Maz1] prop. II.14.2 or by the main result of [BLR], we 
know that every Jordan-Holder constituent of the representation of Go on this 
T/m-vector space is isomorphic to p%, = pm. It follows that 


dimg/mD(V) = 2dimr/mD(V)° 


where we now regard V as a good Gy-module. On the other hand (4.5.1) 
implies that 


(V)? = Sa(To(N), Fe)[m], 


which is one-dimensional over T/m by the g-expansion principle (lemma 1.34 
for example). We have now shown that D(V) is two-dimensional over T/m, 
and it follows that so is V. 


Ihara’s lemma: We now sketch the proof of lemma 4.24, which proceeds 
by analyzing the behavior of the homology of modular curves under certain 
degeneracy maps. Note that it suffices to prove the lemma for @-7. 

If N and M are positive integers, then we let [)(N, M) =Ti(N) A1o(M) 
and write Yı(N, M) (resp. Xi(.N,M)) for the associated non-compactified 
(resp. compactified) modular curve. The key intermediate result is the fol- 
lowing: 


Lemma 4.28 Suppose that N is a positive integer and p is a prime not di- 


viding N. 


(a) The map 
Ay(X1(N, p), Z) E ad 


is surjective, where a is defined by T =œ T and B by T = pr. 
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(b) If N >3 and £ is an odd prime different from p, then the sequence 


AL(Yi(Np, p°), Ze) > HM(Yi(Np), Ze)? —> Hy (¥i(N), Ze) 
z > (arx, Bit); (Y,2) = Boxy — azz 


is exact, where the maps a, and a2 are defined by T — T and Bı and pə 
by T= pr. 


To prove the lemma, one first translates the statement into one about the 
homology (or cohomology) of the congruence subgroups involved. Part (a) 
of the lemma is due to Ihara [Ih], but see also the proof of thm. 4.1 of [R4]. 
Part (b) due to Wiles is more elementary and is established in the course of 
proving lemma 2.5 of [W3] (see the sequence (2.13)). The result stated there 
is in terms of cohomology rather than homology, but the one given here is 
immediate from it. 


Remark 4.29 A method of Khare [Kh], sec. 2 using modular symbols yields 
an alternate proof of part (b) (after localization at a non-Eisenstein maximal 
ideal). 


To deduce lemma 4.24 from lemma 4.28 we also need the following result: 


Lemma 4.30 Suppose that O is the ring of integers of a finite extension of Qe. 
Let M be a positive integer and H a subgroup of (Z/MZ)*. Let To = Tz & O 
where Tz, is the subring of endomorphisms of M>(Tı(M)) generated by the 
operators (d) and T, for primes r{ M. Ifn is a non-Eisenstein maximal ideal 
then the localization at n of the natural map 


(a) Hy(¥i\(M),O) —> Hı(Xı(M), O) is an isomorphism; 
(b) Hı(Xı(M), O) — Hı(Xy(M), O) is surjective. 
To prove part (a), one checks that 
H (Yı(M), Z) — H (Xı(M), Z) 


is surjective and that T, = r + 1 on the kernel for primes r = 1 mod M (cf. 
proposition 4.13, part (c)). 

Part (b) is most easily proved by showing that Gg acts trivially on the 
cokernel of the natural map 


T(J (M)) > Te(Ju(M)) 


(using [LO], prop. 6 for example) and then applying lemma 4.12. 


136 


Finally we explain how to use lemmas 4.28 and 4.30 to prove lemma 4.24. 
It suffices to consider the case ©’ = NU {p}. We discuss only the more difficult 
case of p # l. We define N = Ny, N’ = Np?, m and wm as in the preceding 
section. Thus w is a maximal ideal of TG where Ty acts on S2(To(N”)). All the 
maps we consider in the argument below will respect the action of the Hecke 
operators in T% where TY is the subring of endomorphisms of M2(I',(Np, p?)) 
generated by the operators (d) and T, for primes r not dividing Np. We let n 
denote the preimage of m’ in T% = TX @ O. We first apply lemma 4.28(b) to 
show that 

Hi (Yi (Np, p*),O)n > Hi (Yi (Np), O)a > Mi(Vi(N), O)n 


n 


is exact. Next we apply lemma 4.30(a) to obtain the exactness of 


Hı(Xı (Np, p’), O)n > H(X (Np), O)? > H (X(N), O)n. 


n 


Applying lemma 4.30(b) with M = Np, we find that 
Hi (X(N, p°), O)a > Hi (X(N, p), O)a > H(X (N), O)n 
is exact. We then apply lemma 4.28(a) to conclude that the map 


H (Xı(N, p°), O)n = Hı(Xı(N), O) 
x b> (A,X, BL, Yat 


is surjective where a, 3 and y are defined respectively by T +> T, T+ pt and 
Tt p’r. Finally, we use lemma 4.30(b) again (now with M = N) to get the 
surjectivity of 

Hi (Xo(Np*), O)n > Ai (X(N), O) 


n? 


hence that of (4.4.4) upon localizing at w’. 

5 Commutative algebra 

In this section we collect some basic facts of commutative algebra that are 
used in the proof. We recall that O is the ring of integers in a finite extension 


K of Q, and that O has residue field k. Let Co denote as in section 2.6 the 
category of complete noetherian local O-algebras with residue field k. 
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5.1 Wiles’ numerical criterion 


In this section we state a numerical criterion discovered by Wiles for a map 
between two rings in Co to be an isomorphism. 


Complete intersections: We say that a ring A in Co is finite flat if it is 
finitely generated and torsion free as an O-module. A key ingredient in Wiles’ 
isomorphism criterion is played by the concept of complete intersection, for 
which we give the following naive definition. 


Definition 5.1 An object A in Co which is finite flat is called a complete 
intersection if it can be expressed as a quotient 


AOU co i E S 


where there are as many relations as there are variables. 


Remark 5.2 It is also true that if an object O[[Y1,... , Y,]]/J in Co which is 
finite flat is a complete intersection, then necessarily J can be generated by 
r elements. See for example [Mat], thm. 21.2 (and lemma 5.11 below). We 
will not use this fact in this chapter, although a special case is proved in the 
course of establishing lemma 5.30. 


The category C: The numerical criterion of Wiles is stated more naturally 
in terms of rings A in the category Co which are endowed with some extra 
structure: namely, a surjective O-algebra homomorphism 74 : A — O. Let 
Cg, be the category whose objects are pairs (A, ma), where A is an object of 
Co and T4 : A — O is a surjective O-algebra homomorphism, also called the 
augmentation map attached to A. Morphisms in C%, are local ring homomor- 
phisms which are compatible in the obvious way with the augmentation maps. 
By abuse of notation one often omits mentioning the augmentation map TA 
when talking of objects in C%, and simply uses A to denote (A, ma), when this 
causes no confusion. Objects of Cg will also be called augmented rings. 


The invariants ®, and 7,4: One associates to an augmented ring (A, ma) 
two basic invariants: 

®, = (ker 74)/(ker r4)’; 

na = Ta(Annyker Ta). 
Here Anna(J) denotes the annihilator ideal of the ideal J in A. 

The invariant ®, can be thought of as a tangent space for the object A. 

(More precisely, it is the cotangent space of the scheme spec(A) at the point 
ker 74.) It is a finitely generated O-module. 
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The invariant 74 seems less familiar at first sight. It is called the congruence 
ideal. (The reason for this terminology should become clearer shortly.) We 
are now ready to state Wiles’ numerical criterion: 


Theorem 5.3 Let ¢: R —> T be a surjective morphism of augmented rings. 
Assume that T is finitely generated and torsion-free as an O-module, and that 
nr # (0). (And hence, in particular, #:(O/nr) < co.) Then the following are 
equivalent: 


(a) The inequality #®pr < #(O/nr) is satisfied. 
(b) The equality #®p = #(O/nr) is satisfied. 


(c) The rings R and T are complete intersections, and the map ọ : R — T 
is an isomorphism. 


Remark 5.4 The above theorem is slightly different from the one that ap- 
pears in [W3], where it is assumed from the outset that the ring T is Goren- 
stein. In the form in which we state it above, the theorem is due to H. Lenstra 
[Len]. Our presentation follows Lenstra’s very closely. For the original (and 
slightly different) point of view, the reader should consult the appendix of 
[W3]. 


Some examples: Before going further, it may be good to pause and consider 
some examples of objects of Cg and the invariants associated to them. While 
logically independent of the proof, the examples should help the reader develop 
some intuition. (For a systematic way to compute the tangent spaces ® 4, see 
the paragraph at the end of section 5.2.) 


Example 1: 
A={(a,b)€OxO, a=b (mod \*)} 
~ OlT]|/(T(T — à”), with 74(a,b) := a. 
ax O/O, na =(A"). 
Example 2: 


A={(a,b,c)\€ OX OxO,a=b=c (mod A)} 
~ O|X, Y]]/(X(X — A), Y(Y — A), XY), with 7 4(a, b,c) := a. 


a ~ O/AOx O/O, — na = (A). 
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A= OXX), with zalf) = £0) 
Py = O, na = 0 
Example 4: 
a= fabodeOx xO a Te a) 
~ OX, Y]|/(X(X - A), Y (Y —A)), with Tala, b,c, d) := a. 


4 ~ O/AO x O/O, na = A] 
Example 5: 
A=O[[X,... , Xall, with m4(f) := f(0). 
a ~O", na = (0). 
Example 6: (A = l = —1 (mod 4), O=Z)). 
A= {(a,b+ ci) € Ze X Zili, a=b (mod £), c=0 (mod 4} 
~ lX X(x? + &)), with m4(a, b + ci) := a. 
a ~ Z/ËZ, na = (0°). 


Example 7: 
A=Ol[T]/AT) ~ O@kT OkT’ O---, with ma( f) = f(0). 
a xk, na = (A). 


5.2 Basic properties of 4 and 7,4 


In this section we collect some of the basic properties of the invariants ® 4 and 
na, and prove the equivalence of (a) and (b) in theorem 5.3. 


Behaviour of ®, under morphisms: The assignment A +> ®,y is a functor 
from the category Cô to the category of O-modules; a morphism A — B in 
C induces a homomorphism ®4 —> ®g of O-modules. Moreover, if A — B 
is surjective, then so is the induced map on the tangent spaces. Therefore, 
when A maps surjectively onto B we have 


#94 > #Oz. (5.2.1) 


There is also a converse to this, which will be useful later: 
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Lemma 5.5 If the homomorphism ®, —> ®p is surjective, then A —> B is 
also surjective. 


Proof: This follows from Nakayama’s lemma. (Cf. [Ha], ch. II, sec. 7.4 and 
[Mat], th. 8.4.) 


Behaviour of 74 under (surjective) morphisms: Unlike the assignment 
At ®y, the assignment A — 7, is not functorial, but it does have a nice 
behaviour under surjective morphisms: namely, if ¢ : A —> B is surjective, 
then 


na Cn, ie, #(O/nas) > #(O/nz). (5.2.2) 
This is simply because in that case ¢ induces a map 


Annyker 74 — Ann pgker Tp. 


Relation between the invariants 4 and 7,4: In general, we have the 
following inequality: 


#814 > #(O/na). (5.2.3) 


The key behind proving this identity is to interpret #®, in terms of Fitting 
ideals. 


Digression on Fitting ideals: If R is a ring (in Co, say) and M is a finitely 
generated R-module, we express M as a quotient of R” for some n: 


0 — M' — R" — M — 0. (5.2.4) 


The Fitting ideal of M, denoted Fitt R(M), is the ideal of R generated by the 
determinants det(v1,... , Un), where the vectors v; € R” range over all possible 
choices of elements of M’ C R”. One checks that this ideal does not depend on 
the choice of exact sequence (5.2.4), and hence is an invariant of the R-module 
M. For example, if M is a finitely generated O-module, we may write 


M = © @O/(A") 8 O/ (A?) @--- BO/(A*), 


with ny > n2 > --- > ng, and the sequence n; is completely determined by 
M. The Fitting ideal Fitt o(M) is then the ideal of O generated by A1 t Tt", 
if r = 0, and is the O-ideal if r > 0. Note in particular that, if M is a finite 
O-module, then 


#M = #(O/Fitt o(M)). (5.2.5) 
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Furthermore, if M is any R-module, it follows directly from the definition that 

Fitt (M) C Anng(M). (5.2.6) 

Fitting ideals behave well under tensor products: in particular, if M is a 
finitely generated A-module, where A is an object in Cg, then: 

ta(Fitt a(M)) = Fitt o(M 84 O), (5.2.7) 


where the tensor product is taken with respect to the augmentation map 74. 
For more details and references on the Fitting ideal, see [Len] for example. 


Now we are ready to prove equation (5.2.3). For, noting that ®, = 
ker 74®,40, where the tensor product is taken with respect to 74, and applying 
equation (5.2.7) with M = ker 74, we have: 


Fitt o(®,) = na(Fitt A(ker Ta)) C mwa(Ann ker 74) = "A; 
(5.2.8) 


where the containment follows from equation (5.2.6). Now the inequality 
(5.2.3) follows by combining (5.2.5) and (5.2.8). 

As a consequence, we have 
Corollary 5.6 The statements (a) and (b) in theorem 5.8 are equivalent. 


Proof. If R —> T is a surjective map of augmented rings, then #®p > 
#®r, by equation (5.2.1). But equation (5.2.3) gives the inequality #®; > 
#:(O/nr). Hence the inequality #®p > #(O/nr) always holds, so that (a) 
implies (b) in theorem 5.3. The reverse implication is clear. 


Remark 5.7 (Computing the tangent spaces ®4): Any object (A, ma) in Cg 
can be expressed as a quotient of the object U = O|[Xi,...,XnJ]] of ex- 
ample 5 with augmentation map given by my(f) = f(0). Indeed, one can 
take a ,... ,@, to be A-module generators of the finitely generated A-module 
ker 74, and obtain the desired quotient map by sending X; to a;. 

The tangent space ®y of U is a free O-module of rank n which can be 
written down canonically as 


OX, BOX2 ®--- BOX, 


the natural map from (kerry) being simply the map which sends a power 
series f € U with no constant term to its degree 1 term, which we will denote 


by f. 
If A is expressed as a quotient U/(f1,..., fr), then one has 


a = Oy/(fi,.-. fr): (5.2.9) 
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5.3 Complete intersections and the Gorenstein condi- 
tion 


In this section we show that complete intersections satisfy a Gorenstein con- 
dition, and that (c) implies (a) and (b) in the statement of Wiles isomorphism 
criterion (theorem 5.3). 


Definition 5.8 An object A in Co which is finite flat is said to be Gorenstein 
if 
Homo(A,O)~ A as A-modules. 


Proposition 5.9 Suppose A in Co is finite flat. If A is a complete intersec- 
tion, then A is Gorenstein. 


The remainder of this section will be devoted to proving proposition 5.9. Since 
the proof is a bit long and involved, and the concepts it uses are not used 
elsewhere, the reader is advised on a first reading to take it on faith and skip 
to the next section. A more direct proof of proposition 5.9 which bypasses the 
arguments of this section is explained in [Len]. 

We let A be a ring which is finite flat, and is a complete intersection. 
(Hence, A can be written as O[[X1,... ,XnJ]]/(f1,---,fn).) We assume that 
the augmentation map for A is induced from the map on O[[Xj,... , Xn] 
sending f to f(0). This implies that f;(0) = 0, i.e., the f; have no constant 
term. 

We recall some definitions from commutative algebra that we will need. 
An ideal J of a local ring R is said to be primary if I # R and every zero 


divisor in R/T is nilpotent. If (7,... , £n) generates a primary ideal of R, and 
n = dim R, then (21,... , £n) is called a system of parameters for R. 

Lemma 5.10 The sequence (fi,... , fn, A) is a system of parameters for U = 
OTA ty eg XM allt 


Proof: The quotient ring U/(fi,..- , fn, A) is local and is finitely generated as 
a k-vector space; therefore every element in its maximal ideal is nilpotent. 


A sequence (z1,... , £n) in a ring R is said to be a regular sequence if x; is 
not a zero-divisor in R/(x1,...,2;-1) fori =1,...,n. 


Lemma 5.11 The sequence (fi,..., fn) is a regular sequence for U. 
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Proof. The ring U is Cohen Macaulay, since A, X1,... , Xn is a system of 
parameters of U which is also a regular sequence. Hence, by theorem 17.4 (iii) 
of [Mat], (f1,..., fn, A) is a regular sequence in U. A fortiori, the sequence 
(fi,--- , fn) is also a regular sequence. 

The proofs of lemma 5.10 and 5.11 use only the fact that A is finitely 
generated as an O-module, and not that A is flat. As a corollary of this proof, 
we therefore have: 


Corollary 5.12 If R is an object of C% which is finitely generated as an O- 
module, and R œ O|[X4,... ,Xn}]/(f1,--- fn), then R is also flat, and hence 
is a complete intersection. 


To go further, we will introduce the Koszul complex 
K(z, R) = Bp-o0Kp(Z, R) 


associated to a local ring R and a sequence x = (21,...,2n) of elements in 
its maximal ideal. For more details on the Koszul complex and its relation 
to regular sequences, the reader can consult [Mat], especially §16, or [Bour] 
X, or [Se3]. This complex is defined to be the free graded differential algebra 
generated by symbols uy,... , Un: 


K,(z, R) = Diy <igcs<iplt Ui AeA Uip» 
with differential d : Kp —> Kp-1 defined by 
p 
d(ujz, Aiie A ti) = N -1t Ui Aeee A Uiga A Uig Atte A Uig: 
t=1 


We denote by H,(x; R) the homology groups of this complex. We record here 
the main properties of this complex that we will use. 


Proposition 5.13 (a) Ho(z; R) = R/(x). 


(b) There is a long exact homology sequence 


: — H(z; R) — Hp(a, £n; R) — Hpa(u; R) 2 


Hp-1(2; R) —> Hp-1(£, £n41; R) —> Hp-2(2; R) — -+ 


(c) H,(x;R) is annihilated by the ideal (x), i.e., it has a natural R/(x)- 
module structure. 
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(d) If x is a regular sequence, then H,(x;R) = 0 for all p > 0 (i.e., the 
complex K(x; R) is a free resolution of R/(x).) 


Proof: The first assertion follows directly from the definition. For (b) and (c), 
see [Mat], th. 16.4. The assertion (d) can be proved by a direct induction 
argument on n, using the long exact homology sequence: For p > 1, this 
sequence becomes 

0 — + Ay (2; £n41; R) — 0, 


and for p = 1, it is 
0 — H(z, %n41; R) — Holz; R) ZS Holz; R). 
But the assumption that £, £n+ı is a regular sequence means that multiplica- 


tion by 2,41 is injective on Ho(x; R) = R/(x). Hence, the assertion (d) follows. 
O 


Now, we turn to the proof of proposition 5.9, following a method of Tate 
which is explained in the appendix of [MRo]. For any ring R, write R||X]] := 
R||X1,... ,Xn]]- Let ai,...,@n be the images in A of Xj,...,X, by the 
natural map 


a: OX] — A= O[[X]]/(f,.-- 5 fn), 


and let 
b : Al Xe 


be the natural map which sends X; to a;. The sequence (g;) = (Xi — a;i) 
generates the kernel of 3. Since the f;, viewed as polynomials in A|[X]], also 
belong to ker 3, we have: 


(Siso fa) = (91,--+ s 9n)M, (5.3.1) 
where M is an n x n matrix with coefficients in A[[X]]. Let 
D = det(M) € A[[X]]. 
Our goal is to construct an A-module isomorphism 
Homo(A, O) — A. 
We begin by constructing a surjective (O-linear) map 


Homojxy(A[[X]], OXID — A. 
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Lemma 5.14 (Tate): The function ®(f) = a(f(D)) induces an isomorphism 
of O|[.X]]-modules 


Homo yxy (A[[X]], OXID- -- + Gn) — A. 


Proof: By lemma 5.11, the sequence (f) = (f1,.--, fn) is a regular O[[X]]- 
sequence. One can see directly from the definition that the sequence (g) = 
(gi) = (X;—a;) is a regular A[[X]|-sequence. Let K(f) and K(g) be the Koszul 


complexes associated to these two sequences. It follows from proposition 5.13 
that the Koszul complex K(f) is a resolution of A by free O[[X]|-modules, 


and the Koszul complex K (g) is a resolution of A by free A[[X]|-modules, and 

hence a fortiori, by free O[[X]]-modules. We define a map ® : K(f) — K(g) 

of complexes by letting : E 
®o : Ko(f) — Kolg) 


be the natural inclusion of O[[X]] into A[[X]], and letting 
®ı : K(f) — Kı (9) 
be the map defined by 
(ilui), ,Bilun)) = (1,222 , Un) M, 


and extending it by skew-linearity to a map of exterior algebras. One can 
check that the resulting map ® is a morphism of complexes which induces the 
identity map A —> A, and satisfies 


®, (Uy A+++ AUn) = D- vp A+++ A Up. 


Applying the functor Homgxy(—, O|[X]]) to these two free resolutions, and 
taking the homology of the resulting complexes, we find that since ® is a homo- 
topy equivalence of complexes it induces an isomorphism on the cohomology, 
and in particular, on the nth cohomology: 


Pn : Homoyxy(Al[X], OXIDO- 5 9n) = 


Homo yxy (OXI, OLA] /(fi,--- > fn) 
which is given explicitly by the formula: 


®,(f) = a(f(D)). 


K 


A, 


We finally come to the proof of proposition 5.9, which we can state in a 
more precise form. 
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Lemma 5.15 The map Y : Homo(A,O) — A defined by V(f) = a(f(D)), 
where f : A[[X]]| — O[[X]] is the base change of f, is an A-module isomor- 
phism, and hence, A is Gorenstein. 


Proof: The key point is to show that Y is A-linear. By definition, if 
a=a(a’) € A, with a’ € O[[X]], 
then 


W(af) = a(f(ad)) = a(f((a— @")D)) + alf (a D). 
Since a — a’ € ker 8, it can be written as an A[[X]]-linear combination of the 
gi- By multiplying the relation 


(Fixes oda) = (91, --- , 9n) M 


by the matrix D - M~! € M,,(Al[[X]]), one sees that the Dg; can be written as 
A|[X]]-linear combinations of the f;’s. Hence, so can the expression (a —a’)D. 
By the O|[|X]]-linearity of f, and the fact that each f; belongs to kera, it 
follows that 7 
a(f((a—a’)D)) = 0. 
Therefore, 7 
Waf) = a(a'f(D)) = aW(f). 
This shows that WV is A-linear. To show that W is surjective, observe that if 


fis... , fr is a O-basis of Homo(A, O), then fi,...,f, is a O[|X]]-basis for 
Homeyxy(A[[X]], CIX]. Hence, for all a € A, there exist p),...,p, such 
that - - 

On(pifi +++: + Prfr) = a. 


But this means that 


V(a(pi) fi ee a(pr) fr) = 4a, 


so that W is surjective. Finally, since Homo( A, ©) and A are free O-modules 
of the same finite rank, and W is surjective, it must also be injective. Hence 
W is an isomorphism, as was to be shown. o 


Example 5.16 Let A C O x O be the ring of example 1 in section 5.1. Then, 
f = T? — XT, and g = T — (0,A”). Hence, f = (T — (à”,0))g, so that 
D =T —(A",0). It follows that 


&(h) = a(h(T — (A",0))) = a(Th(1, 1) — h(à”,0)) = (0, A®)h(1, 1) — A(A”, 0). 


The reader can check directly that ® is indeed an A-linear isomorphism from 
the A-module Homo(A, O) to A. 
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Example 5.17 Let A = Ofe]/(e?) be the ring of example 3 in section 5.1. 
Then, f = T?, and g = T — e. Hence, f = (T +6)g, so that D = T +e. It 
follows that 


H= a TF) = ATIO = SEE 


Exercise 5.18 Write down explicitly an isomorphism A > Homo(A, O) for 
the complete intersection A = O|[X,Y]]/(X(X — A), Y (Y — A)) of example 4 
of section 5.1. 


5.4 The Congruence ideal for complete intersections 


Let A be an object of C, which is finite flat and is a complete intersection as 
in the previous section, so that A œ O[[X1,...,Xn]]/(fi,.-- 5 fn). 

Using the result of the previous section, we will give an explicit formula 
for computing 7,4 in this case, and prove that (c) implies (b) in theorem 5.3. 

Let AY := Homo(A, O), and let mă% : OY —> AY be the dual map. From 
the Gorenstein property of A, we may identify AY with A (as A-modules). Fix 
any identification V : AY —>+ A. (Any two such differ by a unit in A.) One 
checks that 

Wry (OY) = Annaker T4. 

Hence, ņa is the image of the map 74W7\. By using the explicit construction 
of Y given in lemma 5.15, we find: 


tar (OY) = nan(D), 
where D is the determinant defined in section 5.3. By a direct calculation using 


equation (5.3.1), one sees that the right hand side is equal to det(0f;/0X;(0)). 
Hence we have shown: 


Proposition 5.19 Suppose that A is an object of CQ which is finite flat. If 
A=O|[X,... ,Xn]]/(fi,---,fn) is a complete intersection, then 


na = (det(0fi/0X;(0))). 
This proposition implies: 


Corollary 5.20 Let A in C8 be finite flat. If A is a complete intersection, 
then #®, = #(O/na). Hence, the statement (c) implies statement (b) in 
theorem 5.3. 


Proof: The equation (5.2.9) of remark 5.7 implies that 
#24 = #0/ (det (ð fı/3X;(0))). 


The result follows. 
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5.5 Isomorphism theorems 


The usefulness of the notion of complete intersections comes from the following 
two (vaguely stated) principles: 
1. Isomorphisms to complete intersections can often be recognized by looking 
at their effects on the tangent spaces. 
2. Isomorphisms from complete intersections can often be recognized by look- 
ing at their effects on the invariants n. 

These vague principles are made precise in theorems 5.21 and 5.24 respec- 
tively. 


Theorem 5.21 Leto: A — B be a surjective morphism of augmented rings, 
with B a (finite, flat) complete intersection. If @ induces an isomorphism from 
®, to Bp, and these modules are finite, then o is an isomorphism. 


Remark 5.22 Let A = O|[X, Y]]/(X(X — A), Y (Y — A)) be the ring of ex- 
ample 4, let B = O|[X, Y]]/(X(X —A), Y (Y —A), XY) be the ring of example 
2, and let @ : A —> B be the natural projection. The map @ induces an 
isomorphism ®, —> pg, even though ¢ is not an isomorphism. The assump- 
tion that B is a complete intersection is crucial for concluding that ¢ is an 
isomorphism. 


Remark 5.23 The natural map 
OLX] /(X°) — O[[X]]/(X’) 


is a surjective morphism inducing an isomorphism on tangent spaces, and the 
target ring is a complete intersection. Yet this map is not an isomorphism. 
This shows that the assumption on the finiteness of the tangent spaces cannot 
be dispensed with. 


Proof of theorem 5.21: Recall that U = O[[Xi,... , Xn]] is the augmented ring 
of example 5 of section 5.1. Let 


vg :U — B 


be a surjective morphism of augmented rings with ker vg = (f1,..., fn). Let 
b,,... ,b, € ker mg denote the images of X1,... , Xn by vg, and let a1, ... , an € 
ker 74 denote inverse images of b1,... ,bn by @. Since @ is an isomorphism 
on tangent spaces, the elements a; generate (ker 7,)/(ker7,4)?. Hence the 
morphism 

va: O|[X1,...,Xn]] — A 
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defined by v4(X;) = a; induces a surjection Py —> 4, and so it is surjective, 
by lemma 5.5. 

We claim that ker vg is contained in kerv, (and hence, ker vg = ker va). 
For, let g1,... , gn be elements of ker v4 whose linear terms g1,... , Jn generate 
the kernel of 

Va. Py — 4. 


Since ker v4 C ker vg, it follows that there exists an n x n matrix M € M,,(U) 
with entries in U such that 


CIRES In) = (Fiss , fn) M. 


Let M be the matrix of constant terms of the matrix M. Then we have 


(Jis 2G = (Jo , fa) M. 


Since (g1, ... , Jn) and (fi,... , fn) generate the same submodules of rank n 
and finite index in ®y, it follows that det M is a unit in O. Hence, M is 
invertible, and therefore the f; can be expressed as a U-linear combination of 
the gj. This implies that ker vg C kervą. Now we see that vav" gives a 
well-defined inverse to @, so that ¢ is an isomorphism. 


Theorem 5.24 Leto: A — B be a surjective morphism of augmented rings. 
Suppose that A and B are finite flat, and that A a complete intersection. If 
na = Np #0, then ¢ is an isomorphism. 


Remark 5.25 The torsion-freeness assumption on B is essential: if n is large 
enough, then B = A/(ker t4)” satisfies 74 = ng, although the natural map 
A — B is not injective when A Æ O. 


Proof: By proposition 5.9, A is Gorenstein, i.e., it satisfies 
AY := Homo (A, O) ~ A as A-modules. 
Now, we observe that 
ker ra N Ann yker 74 = 0, (5.5.1) 


and likewise for B. For, let x be a non-zero element of 7,4, and let 2’ € 
Annaker 74 satisfy 74(2’) = x. For all a € ker r4 N Annyker 74, we have 


0 =a(z — oe"): ar 
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the first equality because a belongs to Annykerz, and (x — 2’) belongs to 
kerra, the second equality because a belongs to kerma and x’ belongs to 
Annyker ra. Hence a belongs to the O-torsion submodule of A, and therefore 
is 0. 

It follows from (5.5.1) that the homomorphism 7,4 (resp. mg) induces an 
isomorphism from Annyker zy, (resp. Anngker mtg) to na (resp. ng). Since na 
is isomorphic to 7p, it follows that ¢ induces an isomorphism from Ann yker 74 
to Anngker Tp, i.e., 

gAnnyker 74 = Anngker zz. 


From (5.5.1) it also follows a fortiori that 
ker @N Annyker 7,4 = 0, 
hence there is an exact sequence of A-modules: 
0 — ker @ @ Annaker t4 — A. (5.5.2) 
The cokernel of the last map is 
A/ (ker ọ @ Annyker 74) ~ B/(ġAnnaker 7,4) ~ B/(Annpgker 73), 
which is torsion-free, since there is a natural injection 
B/(Annpgker 7g) —> End o(ker 7p). 


Hence, the exact sequence (5.5.2) splits over O. Taking O duals in (5.5.2) 
and using the Gorenstein condition for A, we thus get an exact sequence of 
A-modules: 

A — (ker) @ (Annyker 74)” — 0. 


Applying the functor — ®4 k (relative to the map A — k), we find 
1 = dim;(A 84 k) > dim,((ker ¢)” @4 k) + dim, ((Annaker 74)’ 84 K). 
Since n4 Æ 0, it follows that (Annyker r4)“ @4k £ 0, and hence we must have 
(ker 6)’ 84k =0. 


Therefore by Nakayama’s lemma and duality, ker@ = 0, which proves the 
theorem. 
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5.6 A resolution lemma 


It turns out that objects in C% can be “resolved” (in a weak sense) by a 
complete intersection, namely, 


Theorem 5.26 Let A be an augmented ring which is finite flat over O. Then 
there is a morphism A — A in the category C& such that: 


(a) the ring A is finite flat over O and is a complete intersection; 


(b) the map A — A induces an isomorphism ©; — ® 4. 


Proof: Write A as a quotient of U = O[[X1,... , Xn]] (with my : U — O the 
map which sends each X; to 0.) Let f1,... , fn be elements in the kernel of the 
natural map U — A, such that fi,... , fn generate the kernel of Py — P4. 
Now letting N 

Ä= Ufi., fa) 


would give the desired ring A, provided A is finitely generated: indeed, the 
flatness of A would follow from corollary 5.12. 

Thus we need to show that the f; can be chosen so that A is finitely 
generated. Let a ,,...,@, be O-module generators of the finitely generated 
module ker 74, and define a homomorphism ¢ from the polynomial ring 


VSO ces Xn 


to A by sending X; to a;. Clearly ¢ is surjective. Let fı, ..., fn be elements 
of ker chosen as above, and let m denote their maximal degree. Since the 
elements a? belong to ker 74, we may write 


a? = hi(ai, . eo ‘One 


where h;(X1,..., Xn) is a linear polynomial. Now, replacing the relations f; 
by the relations 

fit XPhi — XP, 
we find that the ring V/(fi,... , fn) is a finitely generated O-module: it can 
be generated by the images of the monomials of degree < n(m + 1), since the 
relations allow us to rewrite any monomial of higher degree in terms of ones of 
lower degree. Completing at the ideal (A, X1,... , Xn), we find that the ring 


AEG cache) 


has the desired properties: the natural homomorphism from A to A induces 
an isomorphism on the tangent spaces, since the linear terms of the f; generate 
the kernel of the induced map Py —> 4 on the tangent spaces, and A is a 
finitely generated O-module, since V/(fi,... , fn) is. 


152 


5.7 <A criterion for complete intersections 


The results we have accumulated so far allow us to give an important criterion 
for an object A to be a complete intersection: 


Theorem 5.27 Let A be an augmented ring which is a finitely generated 
torsion-free O-module. If #®4 < #(O/na4) < œ, then A is a complete inter- 
section. 


Proof: Let ¢ : A — A be the surjective morphism given by the resolution 
theorem (theorem 5.26). Then we have 


#(O/na) = (#94) = (#04) = #(O/N4), 


where the first inequality is by assumption, the second by the choice of A, and 
the third is by the equation (5.2.3). On the other hand, by equation (5.2.2), 
we have 


#(O/na) = #(O/na). 
It follows that 
NA =A; 
so that @ is an isomorphism by theorem 5.24. It follows that A is a complete 
intersection. 


5.8 Proof of Wiles’ numerical criterion 


Theorem 5.28 Let R and T be augmented rings such that T is a finitely 
generated torsion-free O-module, and let ġ : R —> T be a surjective morphism. 


If 
#®r < #(O/nr) < 00, 


then R and T are complete intersections, and ¢ is an isomorphism. 
Proof: We have: 


#(O/nr) < #®r < #Or < #(O/nr), 


where the first inequality is by equation (5.2.3), the second follows from the 
surjectivity of ¢, and the third is by hypothesis. Therefore, 


#®r = #(O/nr), 


and hence T is a complete intersection by theorem 5.27. Since the orders of 
pr and r are the same, ¢ induces an isomorphism between them. Hence @ 
is an isomorphism R — T, by theorem 5.21. This completes the proof. 
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Theorem 5.28 shows that the statements (a) and (b) in theorem 5.3 imply 
the statement (c). Combining corollaries 5.6, 5.20, and theorem 5.28 completes 
the proof of theorem 5.3. 


5.9 A reduction to characteristic £ 


Let C be the category of complete local noetherian k-algebras with residue 
field k. Again all morphisms are assumed to be local. There is a natural 
functor At A from Co to Cp which send A to A := A/A. 

We say that an object A of Ck which is finite dimensional as a k-vector 
space is a complete intersection if it is isomorphic to a quotient 


A = Goes iia 


Note that if an object A of Co is a complete intersection, then A is a 
complete intersection in C. As a partial converse, we have: 


Lemma 5.29 Suppose that R — T is a map in the category Co, and that T is 
finitely generated and free as an O-module. Then R —> T is an isomorphism 
of complete intersections, if and only if R — T is. 


Proof: This is an exercise and is left to the reader. 
We now come to the proof of lemma 3.39 of section 3.4: 


Lemma 5.30 Suppose that K C K' are local fields with rings of integers 
O C O' and that A is an object of Co which is finitely generated and free as 
an O-module. Then A is a complete intersection if and only if A 80 O' is. 


Proof: One implication is clear. Let k and k’ be the residue fields of O and O’ 
respectively. By lemma 5.29 it is enough to prove that, if R is an object of Ck 
which is finite dimensional as a k-vector space, then 


R' = R®,;k’ is a complete intersection = R is a complete intersection. 


Let m and m denote the maximal ideals of R and R’ respectively. 

By assumption, we have R’ = k’|[Yi,... , Y;]]/J, where the ideal J can be 
generated by r elements. We can assume without loss of generality (by adding 
extra variables and relations if necessary) that the images of Y1, ... Y, generate 
m as a k’-vector space. Now, let 


@:k{[X,...,X,]] OR 
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be a ring homomorphism, such that the images of X1,... , X, generate m as 
a k-vector space, and let 


o: kX., X] — R 


be the extension of scalars. Let J and I’ = I @; k’ be the kernels of these 
two maps. We claim that J’ can be generated by r elements. (In fact, this is 
also true without the assumption that the images of X1,... , X, generate m’ 
as a k’ vector space, although we use this assumption in the proof below. Cf. 
remark 5.2.) To see this, choose an isomorphism of k’-vector spaces k'Xı ® 
+--+ BAX, — K'Y. @--- @ k'Y, such that 


k'Xı ara — kY oe @ K'Y, 


m = m 


/ 
commutes. Extending this map to an isomorphism of k’-algebras, 
v: k'[[Xy,...,Xp]] — KIY,- Y], 


one sees that I’ = v~'(J), and hence can be generated by r elements, as 
claimed. 
In particular we have dim, (I’/m'I’) < r, and 


dim, (I/mI) = dimy ((I/mI) @, k’) < dimy (I’/m’T') <r. 


Nakayama’s lemma now implies that J can be generated by r elements, and 
hence R is a complete intersection. This proves the lemma. oO 


5.10 J-structures 


We now turn to the proof of theorem 3.41. In view of the last section we will 
work in characteristic 4. Thus let 7: R —> T be a surjective morphism in the 
category Ck, where R and T are finite dimensional as k-vector spaces. Let r 
be a non-negative integer. If J dk{[S1,...,.$,]] and J C (S1, ..., 57) then by a 
strong J-structure we shall mean a commutative diagram in Ck 


kllsa, Sel] 
EA XJ] Ro oT’ 


k +t 


such that 
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~N 


(a) T'/(S1, -ST >T and R'/(61, .., S) R' —> R, 
(b) for each ideal J D J, I = ker (k[[S1, --; S]] > T/D, 
(c) J = ker (A[[Sj,...,5,]] > 7”) and R' OT’ @R. 


We refer to these as strong J-structures because we have slightly altered the 
conditions from section 3.4 for technical convenience in the rest of this section. 
We will prove the following result. 


Theorem 5.31 Suppose there exist a sequence of ideals Jn <k[||S1, ...,S,|] such 
that Jo = (Sissy Sr), Jn D Jn+1; (Àn Jn = (0) and for each n there exists a 
strong J,-structure. Then RT and these rings are complete intersections. 


Before proving theorem 5.31 we shall explain how to deduce theorem 3.41 
from it. If J is an ideal of O[|[S1, ..., S,]] we will use J to denote its image in 
k[[Si,...,S,]]. If S given by 


Ol Maca b or 


is a J-structure for R — T, let a = ker (R — T), and let R” denote the ring 
Im (R' > (R/mpra  (T'/J)) 80 k). Then S given by 
k[[S1,---, S-]| 
k[[X1, ..-, Xr]] —> R" — (T'/J) 80o k 
(R/mra) 80 k —> Tok, 


is a strong J-structure for (R/mga)@ok > T@ok. If Jn is a sequence of ideals 
as in theorem 3.41 then J, is a sequence of nested ideals with Jo = (S1, ...,.5;) 
and (),, Jn = (0). Moreover if for each n there is a J,,-structure S,, for R > T 
then for each n there is a strong J,-structure S,, for 


(R/mpa) 80 k > T Q0 k. 


Then by theorem 5.31 we see that this map is an isomorphism of complete 
intersections. Lemma 5.29 shows that theorem 3.41 follows. 

Before returning to the proof of theorem 5.31, let us first make some re- 
marks about strong J-structures. 
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e The set of strong J structures for all ideals J C (S4, ..., Sp) C k[[S1, ..., S|] 
forms a category, with the obvious notion of morphism. 


e If Sis a strong J-structure and if (S},...,S,) D J’ D J then there is a natural 
J'-structure S mod J’ obtained by replacing T” by T’/J’ and R’ by the image 
of R' > R@(T'/J'). 


e If R is a finite dimensional k-vector space and if J has finite index in 
k{[Si,...,.5,]] then there are only finitely many isomorphism classes of strong 
J-structure. (This follows because we can bound the order of R’ in any J- 
structure. Explicitly we must have #R' < (#R)(#Kk[[S1, ..., S,]]/J)tm T.) 


Lemma 5.32 Suppose that R is a finite dimensional k vector space. Suppose 
also that {J,} is a nested (decreasing) sequence of ideals and that J = p Jn. 
If for each n a strong Jn structure exists then a strong J structure exists. 


Proof: We may suppose that each J,, has finite index in k[[\Sj,...,5,]]. Let Sn 
denote a strong J,,-structure. Let Sn,m = Sn mod Jm if m < n. Because there 
are only finitely many isomorphism classes of strong Jm structure, we may 
recursively choose integers n(m) such that 


© Srim)m = Snm for infinitely many n, 
e if m > 1 then So caw. Ong 


Let S, = nian 
Sl, mod Jm, = Sh 


structure. 


Then Sl, is a strong Jm structure and if m > m, then 
One checks that S = lim S, is the desired strong J- 


1° 


Lemma 5.33 Suppose that a strong (0) structure exists. Then the map R — 
T is an isomorphism, and these rings are complete intersections. 


Proof: Because k|[S1, ...,S;|] — T’ (and T” is a finitely generated k[[S1, ..., S;]]- 
module by Nakayama’s lemma, cf. [Mat], thm. 8.4) we see that the Krull 
dimension of T” is at least r. On the other hand &/[X1,..., X;,]]| —> T” and so 
by Krull’s principal ideal theorem this map must be an isomorphism. Thus 


K[[X1, .--) Xr] Z RST. 
Hence we have that 


~N 


E E CS ae Fe Shae oe 


and the lemma follows. 
Theorem 5.31 follows at once from these two lemmas. oO 
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